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TITLE 

TRYPTOPHAN BIOSYNTHETIC ENZYMES 

This application claims the benefit of U.S. Provisional Application No. 60/079,386, 
filed March 26, 1998. 
5 FIELD OF THE INVENTION 

This invention is in the field of plant molecular biology. More specifically, this 
invention pertains to nucleic acid fragments encoding enzymes involved in the biosynthesis 
of tyrotophan in plants and seeds. 

BACKGROUND OF THE INVENTION 
10 Many vertebrates, including man, lack the ability to manufacture a number of amino 

acids and therefore require these amino acids preformed in their diet. These are called 
essential amino acids. Plants are able to synthesize all twenty amino acids and serve as the 
ultimate source of the essential amino acids for humans and animals. Thus, the ability to 
manipulate the production and accumulation of the essential amino acids in plants is of 
15 considerable importance and value. Furthermore, the inability of animals to synthesize these 
amino acids provides a useful distinction between animal and plant cellular metabolism. 
This can be exploited for the discovery of herbicidal chemical compounds that target 
enzymes in the plant biosynthetic pathways of the essential amino acids and thus have low 
toxicity to animals. 

20 Tryptophan is an essential amino acid. In plants, the biosynthesis of tryptophan from 

chorismic acid (see Figure 1) requires five enzymatic steps catalyzed by anthranilate 
synthase (EC 4.1.3.27), anthranilate phosphoribosyl-transferase (EC 2.4.2.18), 
phosphoribosylanthranilate isomerase (EC 5.3.1.24), indole-3-glycerol phosphate synthase 
(EC 4.1.1 .48) and tryptophan synthase (EC 4.2. 1 .20). The tryptophan pathway leads to the 

25 biosynthesis of many secondary metabolites including the hormone indole-3 -acetic acid, 

antimicrobial phytoalexins, alkaloids and glucosinolates. The first enzyme in the tryptophan 
pathway from chorismate is anthranilate synthase. This enzyme is subject to feedback 
inhibition by tryptophan and is composed of two subunits. Even though the anthranilate 
synthase alpha subunits are encoded by duplicate genes, in Arabidopsis and Ruta graveolens 

30 it has been shown that only one gene is induced in response to wounding or pathogen attack 
(Bohlmann, J. et al. (1995) Plant! 7:491-501 and Niyogi, K. K. and Fink, G. R. (1992) 
Plant Cell 4:721-733). Glutamine- and ammonia-dependent anthranilate synthase activities 
copurify in Ruta graveolens. The glutamine-dependent reaction of one of the alpha subunits 
requires a 60 to 65 kDa anthranilate synthase beta subunit (Bohlmann, J. et al. (1995) Plant 

35 J. 7:491-501) The beta-subunit is a highly asymmetric dimer with an apparent molecular 
weight of 200,000. Cleavage of the purified subunit with elastase, trypsin, or chymotrypsin 
results in fragments which retain enzyme activity. Elastase digestion results in a 30 kDa 
fragment and a 56 kDa fragment. The first fragment behaves as a monomer and interacts 
with free alpha subunit to produce the glutamine-dependent anthranilate synthase activity. 
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The second fragment behaves as an asymmetric dimer and has N-(5'-phosphoribosyl) 
anthraniiate isomerase and indole-3-glycerol phosphate synthase activity (Walker, M. S. and 
DeMoss, J. A. (1983) 75/^/ Chem 255:3571-3575). 

The two final reactions in tryptophan biosynthesis are catalyzed by tryptophan 
5 synthase. The 29 kDa alpha subunit is a bifunctional enzyme which cleaves indole-3- 
glycerol phosphate to produce indole and glyceraldehydeO -phosphate. The beta subunit 
joins indole with serine to form tryptophan. Either subunit alone is enzymatically active, but 
the rate of the reaction and affinity for the substrates increases when the subunits are forming 
a tetramer composed of two alpha subunits and two beta subunits (Radwanski, E. R. (1995) 
10 Mol Gen Genet 2^5:657-667). 

Few of the genes encoding enzymes fi-om the tryptophan pathway in com, soybeans, 
rice and wheat, have been isolated and sequenced. For example, no com, soybean, rice or 
wheat genes have been reported for anthraniiate synthase alpha or beta subunits and no 
soybean, rice or wheat genes have been reported for tryptophan synthase alpha subunit. A 
15 com gene encoding tryptophan synthase has been identified and the instant invention 

describes a new com tryptophan synthase homolog. Accordingly, the availability of nucleic 
acid sequences encoding all or a portion of these enzymes will facilitate studies to better 
understand cellular biosynthetic pathways, provide genetic tools for the manipulation of 
those pathways, provide a means to evaluate chemical compounds for their ability to inhibit 
20 the enzymatic activity of the compounds included in this application. 

SUMMARY OF THE INVENTION 
The instant invention relates to isolated nucleic acid fragments encoding tryptophan 
biosynthetic enzymes. Specifically, this invention concerns an isolated nucleic acid 
fragment encoding an anthraniiate synthase alpha subunit, an anthraniiate synthase beta 
25 subunit or a tryptophan synthase alpha subunit. In addition, this invention relates to a 
nucleic acid fragment that is complementary to the nucleic acid fragment encoding 
anthraniiate synthase alpha subunit, anthraniiate synthase beta subunit or tryptophan 
synthase alpha subunit. 

An additional embodiment of the instant invention pertains to a polypeptide encoding 
30 all or a substantial portion of a tryptophan biosynthetic enzyme selected from the group 
consisting of anthraniiate synthase alpha subunit, anthraniiate synthase beta subunit and 
tryptophan synthase alpha subunit. 

In another embodiment, the instant invention relates to a chimeric gene encoding an 
anthraniiate synthase alpha subunit, an anthraniiate synthase beta subunit or a tryptophan 
35 synthase alpha subunit, or to a chimeric gene that comprises a nucleic acid fi-agment that is 
complementary to a nucleic acid fragment encoding an anthraniiate synthase alpha subunit, 
an anthraniiate synthase beta subunit or a tryptophan synthase alpha subunit, operably linked 
to suitable regulatory sequences, wherein expression of the chimeric gene results in 
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production of levels of the encoded protein in a transformed host cell that is altered (i.e., 
increased or decreased) from the level produced in an untransformed host cell. 

In a further embodiment, the instant invention concerns a transformed host cell 
comprising in its genome a chimeric gene encoding an anthranilate synthase alpha subunit, 
5 an anthranilate synthase beta subunit or a tryptophan synthase alpha subunit, operably linked 
to suitable regulatory sequences. Expression of the chimeric gene results in production of 
altered levels of the encoded protein in the transformed host cell. The transformed host cell 
can be of eukaryotic or prokaryotic origin, and include cells derived from higher plants and 
microorganisms. The invention also includes transformed plants that arise from transformed 

10 host cells of higher plants, and seeds derived from such transformed plants. 

An additional embodiment of the instant invention concerns a method of altering the 
level of expression of an anthranilate synthase alpha subunit, an anthranilate synthase beta 
subunit or a tryptophan synthase alpha subunit in a transformed host cell comprising: 
a) transforming a host cell with a chimeric gene comprising a. nucleic acid fragment 

15 encoding an anthranilate synthase alpha subunit, an anthranilate synthase beta subunit or a 
tryptophan synthase alpha subunit; and b) growing the transformed host cell under 
conditions that are suitable for expression of the chimeric gene wherein expression of the 
chimeric gene results in production of altered levels of anthranilate synthase alpha subunit, 
anthranilate synthase beta subunit or tryptophan synthase alpha subunit in the transformed 

20 host cell. 

An addition embodiment of the instant invention concerns a method for obtaining a 
nucleic acid fragment encoding all or a substantial portion of an amino acid sequence 
encoding an anthranilate synthase alpha subunit, an anthranilate synthase beta subunit or a 
tryptophan synthase alpha subunit. 

25 A further embodiment of the instant invention is a method for evaluating at least one 

compound for its ability to inhibit the activity of an anthranilate synthase alpha subunit, an 
anthranilate synthase beta subunit or a tryptophan synthase alpha subunit, the method 
comprising the steps of: (a) transforming a host cell with a chimeric gene comprising a 
nucleic acid fragment encoding an anthranilate synthase alpha subtmit, an anthranilate 

30 synthase beta subunit or a tryptophan synthase alpha subunit, operably linked to suitable 

regulatory sequences; (b) growing the transformed host cell under conditions that are suitable 
for expression of the chimeric gene wherein expression of the chimeric gene results in 
production of anthranilate synthase alpha subunit, anthranilate synthase beta subunit or 
tryptophan synthase alpha subunit in the transformed host cell; (c) optionally purifying the 

35 anthranilate synthase alpha subunit, the anthranilate synthase beta subunit or the tryptophan 
synthase alpha subunit expressed by the transformed host cell; (d) treating the anthranilate 
synthase alpha subunit, the anthranilate synthase beta subunit or the tryptophan synthase 
alpha subunit with a compound to be tested; and (e) comparing the activity of the 
anthranilate synthase alpha subunit, the anthranilate synthase beta subunit or the tryptophan 
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synthase alpha subunit that has been treated with a test compound to the activity of an 
untreated anthranilate synthase alpha subunit, anthranilate synthase beta subunit or 
tryptophan synthase alpha subunit, thereby selecting compounds with potential for inhibitory 
activity. 

5 BRIEF DESCRIPTION OF THE 

DRAWTNGS AND SEQUENCE DESCRIPTIONS 
The invention can be more fully understood from the following detailed description 
and the accompanying drawings and Sequence Listing which form a part of this application. 
Figure I depicts biochemical pathway for the production of tryptophan from 
10 chorismate. 

Figure 2 shows an alignment of the amino acid sequences from Rtdta graveolens 
anthranilate synthase alpha subunit (SEQ ID NO:39) and the instant com anthranilate 
synthase alpha subunit (contig of cdelc.pk004.g2, p0125.czabf83r, p0005xbmew77r, 
p0096.cnamt32r, p0032.crcas26r, p003 1 .ccmar07r, p0015.cdpfkl2r, p0128.cpibl07r and 

15 ctaln.pk0051.g3; SEQ ID N0:2). Amino acid which are identical among both sequences are 
indicated with an asterisk (*) above the alignment. Dashes are used by the program to 
maximize alignment of the sequences. 

Figure 3 shows an aligrunent of the amino acid sequences from Arabidopsis thaliana 
anthranilate synthase beta subunit (SEQ ID NO:40), the instant com anthranilate synthase 

20 beta subunit (contig of p0126.cnlcu75r, cen3n.pk0212.h6, ccoln.pk0038.bl, 

csiln.pk0017.d4, p0031.ccmai08r and ctnlc.pkOOl.llO; SEQ ID NO:20) and the instant rice 
anthranilate synthase beta subunit (contig of rds2c.pk004.i9, rls24.pk0002.e2 and 
rlOn.pkl 13.b4; SEQ ID NO:22). Amino acid which are identical among all sequences are 
indicated with an asterisk (*) above the alignment. Dashes are used by the program to 

25 maximize alignment of the sequences. 

Figure 4 shows an alignment of the amino acid sequences from Arabidopsis thaliana 
tryptophan synthase alpha subunit (SEQ ID N0:41), the instant com tryptophan synthase 
alpha subunit (contig of chp2.pk0020.e2 and lkr.pk0013.gl; SEQ ID NO:26), the instant 
com tryptophan synthase alpha subunit (crln.pk0052.b8; SEQ ID NO:28) and the instant 

30 rice tryptophan synthase alpha subunit (rrl.pk0038.h6; SEQ ID NO:32). Amino acid which 
are identical among all sequences are indicated with an asterisk (*) above the aligrunent. 
Dashes are used by the program to maximize alignment of the sequences. 

The following sequence descriptions and Sequence Listing attached hereto comply 
with the mles goveming nucleotide and/or amino acid sequence disclosures in patent 

35 applications as set forth in 37 C.F.R. §1.821-1.825. 

SEQ ID N0:1 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA inserts in clones cdelc.pk004.g2, pOI25.czabf83r, p0005.cbmew77r, 
p0096.cnamt32r, p0032.crcas26r, p003 1 .ccmar07r, p0015.cdpfkl2r and p0128.cpibl07r and 
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the entire cDNA insert in clone ctaln.pkOOS 1 .g3 encoding an entire com anthranilate 
synthase alpha subunit. 

SEQ ID N0:2 is the deduced amino acid sequence of an entire com anthranilate 
synthase alpha subunit derived from the nucleotide sequence of SEQ ID N0:1. 
5 SEQ ID N0:3 is the nucleotide sequence comprising the contig assembled from a 

portion of the cDNA inserts in clones pOl 15.clsmj70r, cbn2.pk0049.d2, cpllc.pk008.m4 and 
p0102.ceraq01r and the entire cDNA insert in clone ceb3.pk001 1x12 encoding a portion of 
a com anthranilate synthase alpha subunit. 

SEQ ID N0:4 is the deduced amino acid sequence of a portion of a com anthranilate 
10 synthase alpha subunit derived from the nucleotide sequence of SEQ ID N0:3. 

SEQ ID N0:5 is the nucleotide sequence comprising a portion of the cDNA insert in 
clone rcaln.pk004.p22 encoding a portion of a rice anthranilate synthase alpha subunit. 

SEQ ID N0:6 is the deduced amino acid sequence of a portion of a rice anthranilate 
synthase alpha subunit derived from the nucleotide sequence of SEQ ID N0:5. 
15 SEQ ID N0:7 is the nucleotide sequence comprising a portion of the cDNA insert in 

clone sdp3c.pk019.c6 encoding a portion of a soybean anthranilate synthase alpha subunit. 

SEQ ID N0:8 is the deduced amino acid sequence of a portion of a soybean 
anthranilate synthase alpha subunit derived from the nucleotide sequence of SEQ ID N0:7. 
SEQ ID N0:9 is the nucleotide sequence comprising a portion of the cDNA insert in 
20 clone sfll ,pkl 29.dl 0 encoding a portion of a soybean anthranilate synthase alpha subunit. 
SEQ ID NO: 10 is the deduced amino acid sequence of a portion of a soybean 
anthranilate synthase alpha subunit derived from the nucleotide sequence of SEQ ID N0:9. 

SEQ ID NO: 1 1 is the nucleotide sequence comprising a portion of the 5' terminus of 
the cDNA insert in clone wleln.pk0075.b4 encoding a portion of a wheat anthranilate 
25 synthase alpha subunit. 

SEQ ID NO: 12 is the deduced amino acid sequence of a portion of a wheat 
anthranilate synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO: 1 1. 

SEQ ID NO: 13 is the nucleotide sequence comprising a portion of the 3' terminus of 
the cDNA insert in clone wle 1 n.pk0075.b4 encoding a portion of a wheat anthranilate 
30 synthase alpha subunit. 

SEQ ID NO: 14 is the deduced amino acid sequence of a portion of a wheat 
anthranilate synthase alpha subunit derived from the nucleotide sequence of SEQ ID N0:13. 

SEQ ID NO: 15 is the nucleotide sequence comprising a portion of the cDNA insert in 
clone wlm96.pk026.j 1 encoding a portion of a wheat anthranilate synthase alpha subunit, 
35 SEQ ID NO; 1 6 is the deduced amino acid sequence of a portion of a wheat 

anthranilate synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO: 15. 

SEQ ID NO: 17 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA insert in clones cen3n.pk0210.c3, cepe7.pk0012.g2 and 
cen3n.pk0047.g5 encoding a portion of a com anthranilate synthase beta subunit. 
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SEQ ID NO: 1 8 is the deduced amino acid sequence of a portion of a com anthranilate 
synthase beta subunit derived from the nucleotide sequence of SEQ ID NO: 17. 

SEQ ID NO: 19 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA inserts in clones p0126,cnlcu75r, cen3n.pk0212.h6, ccoln.pk0038.bl, 
5 csiln.pk0017.d4, p0031xcmai08r and ctnlc.pkOOl.llO encoding an entire com anthranilate 
synthase beta subunit. 

SEQ ID NO:20 is the deduced amino acid sequence of an entire com anthranilate 
synthase beta subunit derived from the nucleotide sequence of SEQ ID NO: 19. 

SEQ ID N0:2I is the nucleotide sequence comprising the contig assembled from a 
10 portion of the cDNA inserts in clones rds2c.pk004.i9, ris24.pk0002.e2 and rlOn.pkl 13.b4 
encoding a a substantial portion of a rice anthranilate synthase beta subunit. 

SEQ ID NO:22 is the deduced amino acid sequence of a substantial portion of a rice 
anthranilate synthase beta subunit derived from the nucleotide sequence of SEQ ID N0:2I. 
SEQ ID NO:23 is the nucleotide sequence comprising a portion of the cDNA insert in 
15 clone wlm96.pk045.cl 1 encoding a portion of a wheat anthranilate synthase beta subunit. 
SEQ ID NO:24 is the deduced amino acid sequence of a portion of a v^heat 
anthranilate synthase beta subunit derived from the nucleotide sequence of SEQ ID NO:23. 

SEQ ID NO:25 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA inserts in clones chp2.pk0020.e2 and lkr.pk0013.gl encoding an entire 
20 com tryptophan synthase alpha subunit. 

SEQ ID NO:26 is the deduced amino acid sequence of an entire com tryptophan 
synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO:25. 

SEQ ID NO:27 is the nucleotide sequence comprising a portion of the cDNA insert in 
clone crln.pk0052.b8 encoding an almost entire com tryptophan synthase alpha subunit. 
25 SEQ ID NO:28 is the deduced amino acid sequence of an almost entire com 

tryptophan synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO:27. 

SEQ ID NO:29 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA inserts in clones reslc.pk004.c4 and rrl.pk0071 .f2 encoding a portion 
of a rice tryptophan synthase alpha subunit. 
30 SEQ ID NO:30 is the deduced amino acid sequence of a portion of a rice tryptophan 

synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO:29. 

SEQ ID N0:3 1 is the nucleotide sequence comprising a portion of the cDNA insert in 
clone nl .pkO038.h6 encoding an entire rice tryptophan synthase alpha subunit. 

SEQ ID NO:32 is the deduced amino acid sequence of an entire rice tryptophan 
35 synthase alpha subunit derived from the nucleotide sequence of SEQ ID N0:3 1 . 

SEQ ID NO:33 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA inserts in clones src3c.pk004.cl9and sls2c.pk025.dl 1 encoding a 
portion of a soybean tryptophan synthase alpha subunit. 
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SEQ ID NO:34 is the deduced amino acid sequence of a portion of a soybean 
tryptophan synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO:33. 

SEQ ID NO:35 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA inserts in clones wlm0.pk0028.f4 and wlm96.pk041.h3 encoding a 
5 portion of a wheat tryptophan synthase alpha subunit. 

SEQ ID NO:36 is the deduced amino acid sequence of a portion of a wheat tryptophan 
synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO:35. 

SEQ ID NO:37 is the nucleotide sequence comprising the contig assembled from a 
portion of the cDNA inserts in clones wlln.pk0109.d5 and wlln.pkOl 10x1 encoding a 
10 portion of a wheat tryptophan synthase alpha subunit 

SEQ ID NO:38 is the deduced amino acid sequence of a portion of a wheat tryptophan 
synthase alpha subunit derived from the nucleotide sequence of SEQ ID NO:37. 

SEQ ID NO:39 is the amino acid sequence of a Ruta graveolens anthranilate synthase 
alpha subunit having an NCBI General Identifier No. 960289. 
15 SEQ ID NO:40 is the amino acid sequence ofaArabidopsis thaliana anthranilate 

synthase beta subunit having an NCBI General Identifier No. 541849. 

SEQ ID N0:41 is the amino acid sequence of a Arabidopsis thaliana tryptophan 
synthase alpha subunit having an NCBI General Identifier No. 2129755. 

The Sequence Listing contains the one letter code for nucleotide sequence characters 
20 and the three letter codes for amino acids as defined in conformity with the lUPAC-IUBMB 
standards described in Nucleic Acids Research 7 J:3021-3030 (1985) and in the Biochemical 
Journal 219 (No. 2;:345-373 (1984) which are herein incorporated by reference. The 
symbols and format used for nucleotide and amino acid sequence data comply with the rules 
setforthin37C.F.R. §1.822. 
25 DETAILED DESCRIPTION OF THE INVENTION 

In the context of this disclosure, a number of terms shall be utilized. As used herein, 
an "isolated nucleic acid fragment" is a polymer of RNA or DNA that is single- or double- 
stranded, optionally containing synthetic, non-natural or altered nucleotide bases. An 
isolated nucleic acid fragment in the form of a polymer of DNA may be comprised of one or 
30 more segments of cDNA, genomic DNA or synthetic DNA. As used herein, "contig" refers 
to an assemblage of overlapping nucleic acid sequences to form one contiguous nucleotide 
sequence. For example, several DNA sequences can be compared and aligned to identify 
common or overlapping regions. The individual sequences can then be assembled into a 
single contiguous nucleotide sequence. 
35 As used herein, "substantially similar" refers to nucleic acid fragments wherein 

changes in one or more nucleotide bases results in substitution of one or more amino acids, 
but do not affect the fiinctional properties of the protein encoded by the DNA sequence, 
"Substantially similar" also refers to nucleic acid fragments wherein changes in one or more 
nucleotide bases does not affect the ability of the nucleic acid fragment to mediate alteration 
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of gene expression by antisense or co-suppression technology, "Substantially similar" also 
refers to modifications of the nucleic acid fragments of the instant invention such as deletion 
or insertion of one or more nucleotides that do not substantially affect the functional 
properties of the resulting transcript vis-a-vis the ability to mediate alteration of gene 
5 expression by antisense or co-suppression technology or alteration of the functional 

properties of the resulting protein molecule. It is therefore understood that the invention 
encompasses more than the specific exemplary sequences. 

For example, it is well known in the art that antisense suppression and co-suppression 
of gene expression may be accomplished using nucleic acid fragments representing less than 

10 the entire coding region of a gene, and by nucleic acid fragments that do not share 100% 
sequence identity with the gene to be suppressed. Moreover, alterations in a gene which 
result in the production of a chemically equivalent amino acid at a given site, but do not 
effect the functional properties of the encoded protein, are well knovm in the art. Thus, a 
codon for the amino acid alanine, a hydrophobic amino acid, may be substituted by a codon 

15 encoding another less hydrophobic residue, such as glycine, or a more hydrophobic residue, 
such as valine, leucine, or isoleucine. Similarly, changes which result in substitution of one 
negatively charged residue for another, such as aspartic acid for glutamic acid, or one 
positively charged residue for another, such as lysine for arginine, can also be expected to 
produce a functionally equivalent product. Nucleotide changes which resuh in alteration of 

20 the N-terminal and C-terminal portions of the protein molecule would also not be expected 
to alter the activity of the protein. Each of the proposed modifications is well within the 
routine skill in the art, as is determination of retention of biological activity of the encoded 
products. Moreover, the skilled artisan recognizes that substantially similar nucleic acid 
sequences encompassed by this invention are also defined by their ability to hybridize, under 

25 stringent conditions (O.IX SSC, 0.1% SDS, 65°C), with the sequences exemplified herein. 
Substantially similar nucleic acid fragments of the instant invention may also be 
characterized by the percent identity of the amino acid sequences that they encode to the 
amino acid sequences disclosed herein, as determined by algorithms commonly employed by 
those skilled in this art. Preferred are those nucleic acid fragments whose nucleotide 

30 sequences encode amino acid sequences that are greater than 80% identical to the amino acid 
sequences reported herein. More preferred nucleic acid fragments encode amino acid 
sequences that are 90% identical to the amino acid sequences reported herein. Most 
preferred are nucleic acid firagments that encode amino acid sequences that are 95% identical 
to the amino acid sequences reported herein. Sequence alignments and percent similarity 

35 calculations were performed using the Megalign program of the LASARGENE 

bioinformatics computing suite (DNASTAR Inc., Madison, WI). Multiple alignment of the 
sequences was performed using the Clustal method of alignment (Higgins, D.G. and Sharp, 
P.M. (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP 
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LENGTH PEN ALT Y= 10). Default parameters for pairwise alignments using the Clustal 
method were KTUPLE 1, GAP PENALTY=3, WIND0W=5 and DIAGONALS SAVED=5. 

A "substantial portion" of an amino acid or nucleotide sequence comprises enough of 
the amino acid sequence of a polypeptide or the nucleotide sequence of a gene to afford 
5 putative identification of that polypeptide or gene, either by manual evaluation of the 
sequence by one skilled in the art, or by computer-automated sequence comparison and 
identification using algorithms such as BLAST (Basic Local Alignment Search Tool; 
Altschul, S. F., et al., (1993) J. Mol Biol 2/5:403-410; see also 

www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence of ten or more contiguous amino 
10 acids or thirty or more nucleotides is necessary in order to putatively identify a polypeptide 
or nucleic acid sequence as homologous to a known protein or gene. Moreover, with respect 
to nucleotide sequences, gene specific oligonucleotide probes comprising 20-30 contiguous 
nucleotides may be used in sequence-dependent methods of gene identification (e.g.. 
Southern hybridization) and isolation (e.g., in situ hybridization of bacterial colonies or 
15 bacteriophage plaques). In addition, short oligonucleotides of 12-15 bases may be used as 
amplification primers in PGR in order to obtain a particular nucleic acid fragment 
comprising the primers. Accordingly, a "substantial portion" of a nucleotide sequence 
comprises enough of the sequence to afford specific identification and/or isolation of a 
nucleic acid fi-agment comprising the sequence. The instant specification teaches partial or 
20 complete amino acid and nucleotide sequences encoding one or more particular plant 

proteins. The skilled artisan, having the benefit of the sequences as reported herein, may 
now use all or a substantial portion of the disclosed sequences for purposes known to those 
skilled in this art. Accordingly, the instant invention comprises the complete sequences as 
reported in the accompanying Sequence Listing, as well as substantial portions of those 
25 sequences as defined above. 

"Codon degeneracy" refers to divergence in the genetic code permitting variation of 
the nucleotide sequence without effecting the amino acid sequence of an encoded 
polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment that 
encodes all or a substantial portion of the amino acid sequence encoding the anthranilate 
30 synthase alpha subunit, the anthranilate synthase beta subunit or the tryptophan synthase 
alpha subunit proteins as set forth in SEQ ID N0s:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, 30, 32, 34, 36 and 38. The skilled artisan is well aware of the "codon-bias" exhibited 
by a specific host cell in usage of nucleotide codons to specify a given amino acid. 
Therefore, when synthesizing a gene for improved expression in a host cell, it is desirable to 
35 design the gene such that its firequency of codon usage approaches the frequency of preferred 
codon usage of the host cell. 

"Synthetic genes" can be assembled from oligonucleotide building blocks that are 
chemically synthesized using procedures known to those skilled in the art. These building 
blocks are ligated and annealed to form gene segments which are then enzymatically 
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assembled to construct the entire gene. "Chemically synthesized", as related to a sequence 
of DNA, means that the component nucleotides were assembled in vitro. Manual chemical 
synthesis of DNA may be accomplished using well established procedures, or automated 
chemical synthesis can be performed using one of a number of commercially available 

5 machines. Accordingly, the genes can be tailored for optimal gene expression based on 
optimization of nucleotide sequence to reflect the codon bias of the host cell The skilled 
artisan appreciates the likelihood of successful gene expression if codon usage is biased 
towards those codons favored by the host. Determination of preferred codons can be based 
on a survey of genes derived from the host cell where sequence infonmation is available. 

10 "Gene" refers to a nucleic acid fragment that expresses a specific protein, including 

regulatory sequences preceding (5' non-coding sequences) and following (3* non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 
own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 

15 Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 
from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A "foreign" gene refers to a gene not normally found in the host organism, but 

20 that is introduced into the host organism by gene transfer. Foreign genes can comprise 

native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a DNA sequence that codes for a specific amino acid 
sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5' non- 
25 coding sequences), within, or dovmstream (3' non-coding sequences) of a coding sequence, 
and which influence the transcription, RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a DNA sequence capable of controlling the expression of a 

30 coding sequence or functional RNA. In general, a coding sequence is located 3' to a 

promoter sequence. The promoter sequence consists of proximal and more distal upstream 
elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
DNA sequence which can stimulate promoter activity and may be an innate element of the 
promoter or a heterologous element inserted to enhance the level or tissue-specificity of a 

35 promoter. Promoters may be derived in their entirety from a native gene, or be composed of 
different elements derived from different promoters found in nature, or even comprise 
synthetic DNA segments. It is understood by those skilled in the art that different promoters 
may direct the expression of a gene in different tissues or cell types, or at different stages of 
development, or in response to different environmental conditions. Promoters which cause a 
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gene to be expressed in most cell types at most times are commonly referred to as 
"constitutive promoters". New promoters of various types useful in plant cells are 
constantly being discovered; numerous examples may be found in the compilation by 
Okamuro and Goldberg, (1989) Biochemistry of Plants 75:1-82. It is further recognized that 
5 since in most cases the exact boundaries of regulatory sequences have not been completely 
defined, DNA fragments of different lengths may have identical promoter activity. 

The "translation leader sequence" refers to a DNA sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 
present in the fully processed mRNA upstream of the translation start sequence. The 
10 translation leader sequence may affect processing of the primary transcript to mRNA, 
mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner, R. and Foster, G. D. (1995) Molecular Biotechnology i:225). 

The "3' non-coding sequences" refer to DNA sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 
15 encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 
tracts to the 3* end of the mRNA precursor. The use of different 3' non-coding sequences is 
exemplified by Ingelbrecht et al., (1989) Plant Cell 7:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase-catalyzed 
20 transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived firom posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that is 
without introns and that can be translated into protein by the cell. "cDNA" refers to a 
25 double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that includes the mRNA and so can be translated into protein by the 
cell. "Antisense RNA" refers to a RNA transcript that is complementary to all or part of a 
target primary transcript or mRNA and that blocks the expression of a target gene (U.S. Pat. 
No. 5,107,065, incorporated herein by reference). The complementarity of an antisense 
30 RNA may be with any part of the specific gene transcript, i.e., at the 5* non-coding sequence, 
3' non-coding sequence, introns, or the coding sequence. "Functional RNA" refers to sense 
RNA, antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet has 
an effect on cellular processes. 

The term "operably linked" refers to the association of nucleic acid sequences on a 
35 single nucleic acid fragment so that the flmction of one is affected by the other. For 
example, a promoter is operably linked with a coding sequence when it is capable of 
affecting the expression of that coding sequence (i.e., that the coding sequence is under the 
transcriptional control of the promoter). Coding sequences can be operably linked to 
regulatory sequences in sense or antisense orientation. 
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The term ''expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 
the invention. Expression may also refer to translation of mRNA into a polypeptide. 
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of 
5 suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production in normal or 
non-transformed organisms. "Co-suppression" refers to the production of sense RNA 
transcripts capable of suppressing the expression of identical or substantially similar foreign 
or endogenous genes (U.S. Pat. No. 5,231,020, incorporated herein by reference). 

10 "Altered levels" refers to the production of gene product(s) in transgenic organisms in 

amounts or proportions that differ from that of normal or non-transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 
"Precursor" protein refers to the primary product of translation of mRNA; i.e., with pre- and 

15 propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
conjunction with a protein and directs the protein to the chloroplast or other plastid types 
present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 

20 nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 

amino acid sequence which is translated in conjunction \yith a protein and directs the protein 
to the secretory system (Chrispeels, J. J., (1991) Ann. Rev. Plant Phys. Plant Mol Biol 
^2:21-53). If the protein is to be directed to a vacuole, a vacuolar targeting signal {supra) 
can fiirther be added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention 

25 signal {supra) may be added. If the protein is to be directed to the nucleus, any signal 
peptide present should be removed and instead a nuclear localization signal included 
(Raikhel (1992) Plant Phys. J 00:1627 -1632), 

"Transformation" refers to the transfer of a nucleic acid fragment into the genome of a 
host organism, resulting in genetically stable inheritance. Host organisms containing the 

30 transformed nucleic acid fragments are referred to as "transgenic" organisms. Examples of 
methods of plant transformation include Agrobacterium-mediated transformation (De Blaere 
et al. (1987) Meth. Enzymol 143:211) and particle-accelerated or "gene gun" transformation 
technology (Klein et al. (1987) Nature (London) 327:10-13; U.S. Pat. No, 4,945,050, 
incorporated herein by reference). 

35 Standard recombinant DNA and molecular cloning techniques used herein are well 

known in the art and are described more fully in Sambrook, J., Fritsch, E.F. and Maniatis, T. 
Molecular Cloning: A Laboratory Manual] Cold Spring Harbor Laboratory Press: Cold 
Spring Harbor, 1989 (hereinafter "Maniatis"). 
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Nucleic acid fragments encoding at least a portion of several tryptophan biosynthetic 
enzymes have been isolated and identified by comparison of random plant cDNA sequences 
to public databases containing nucleotide and protein sequences using the BLAST 
algorithms well known to those skilled in the art. Table 1 lists the proteins that are described 
herein, and the designation of the cDNA clones that comprise the nucleic acid fragments 
encoding these proteins. 



TABLE 1 
Tryptophan Biosynthetic Enzymes 



Enzyme 



Clone 



Plant 



Anthranilate Synthase Alpha Subunit 



Anthranilate Synthase Beta Subunit 



Contig of: Com 
cdelc.pk004.g2 
p0125.czabf83r 
p0005.cbmew77r 
p0096.cnamt32r 
p0032.crcas26r 
pO031.ccmarO7r 
p0015.cdpfkl2r 
p0128.cpibl07r 
ctaln.pk0051.g3 

Contig of: Com 
p0115.clsmj70r 
cbn2.pk0049.d2 
cpllc.pk008.m4 
p0102.ceraq0lr 
ceb3.pk001LcI2 

rcaln.pk004.p22 Rice 

sdp3c.pk019.c6 Soybean 

sfll.pkl29.dl0 Soybean 

wleln.pk0075.b4 Wheat 

wlm96.pk026.jl Wheat 

Contig of: Com 
cen3n.pk0210.c3 
cepe7.pk0012.g2 
cen3n.pk0047.g5 

Contig of: Com 
p0126.cnlcu75r 
cen3n.pk0212.h6 
ccoln.pk0038.bl 
csiln.pk0017.d4 
p0031.ccmai08r 
ctnlc.pkOOl.llO 

Contig of: Rice 
rds2c.pk004.i9 
rls24.pk0002.e2 
rl0n.pkll3.b4 

wlm96.pk045.cll Wheat 
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Enzyme Clone Plant 



1 ryptopnan byntnase Aipna MiDunu 


Prmtia nf* 

chp2.pk0020.e2 
lkr.pk0013.gl 


Com 




crln Dk0052 b8 


Com 




Contigof: 

reslc.pk004.c4 

rrl nk-nOTl D 
m .pKuv / 1 .1^ 


Rice 




rrl.pk0038.h6 


Rice 




Contig of: 

src3c.pkO04xl9 
sls2c.pk025.dll 


Soybean 




Contig of: 

wlm0.pk0028.f4 
wlm96.pk041.h3 


Wheat 




Contig of: 

wlln.pk0109.d5 
wUn.pkOllO.cl 


Wheat 



The nucleic acid fragments of the instant invention may be used to isolate cDNAs and 
genes encoding homologous proteins from the same or other plant species. Isolation of 
5 homologous genes using sequence-dependent protocols is well known in the art. Examples 
of sequence-dependent protocols include, but are not limited to, methods of nucleic acid 
hybridization, and methods of DNA and RNA amplification as exemplified by various uses 
of nucleic acid amplification technologies (e.g., polymerase chain reaction, ligase chain 
reaction). 

10 For example, genes encoding other anthranilate synthase alpha subunit, anthranilate 

synthase beta subunit or tryptophan synthase alpha subunit, either as cDNAs or genomic 
DNAs, could be isolated directly by using all or a portion of the instant nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant employing 
methodology well known to those skilled in the art. Specific oligonucleotide probes based 

15 upon the instant nucleic acid sequences can be designed and synthesized by methods known 
in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize DNA 
probes by methods known to the skilled artisan such as random primer DNA labeling, nick 
translation, or end-labeling techniques, or RNA probes using available in vitro transcription 
systems. In addition, specific primers can be designed and used to amplify a part or all of 

20 the instant sequences. The resulting amplification products can be labeled directly during 
amplification reactions or labeled after amplification reactions, and used as probes to isolate 
fiill length cDNA or genomic fragments under conditions of appropriate stringency. 

In addition, two short segments of the instant nucleic acid fragments may be used in 
polymerase chain reaction protocols to amplify longer nucleic acid fragments encoding 
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homologous genes from DNA or RNA. The polymerase chain reaction may also be 
performed on a library of cloned nucleic acid fragments wherein the sequence of one primer 
is derived from the instant nucleic acid fragments, and the sequence of the other primer takes 
advantage of the presence of the polyadenylic acid tracts to the 3' end of the mRNA 
5 precursor encoding plant genes. Alternatively, the second primer sequence may be based 
upon sequences derived from the cloning vector. For example, the skilled artisan can follow 
the RACE protocol (Frohman et al„ (1988) PNAS USA 55:8998) to generate cDNAs by 
using PGR to amplify copies of the region between a single point in the transcript and the 3* 
or 5' end. Primers oriented in the 3' and 5* directions can be designed from the instant 
10 sequences. Using commercially available 3* RACE or 5' RACE systems (BRL), specific 3' 
or 5' cDNA fragments can be isolated (Ohara et al., (1989) PNAS USA 55:5673; Loh et al., 
(1989) Science 243:217). Products generated by the 3' and 5' RACE procedures can be 
combined to generate ftill-length cDNAs (Frohman, M. A. and Martin, G. R., (1989) 
Techniques l:\65). 

15 Availability of the instant nucleotide and deduced amino acid sequences facilitates 

immunological screening of cDNA expression libraries. Synthetic peptides representing 
portions of the instant amino acid sequences may be synthesized. These peptides can be 
used to immunize animals to produce polyclonal or monoclonal antibodies with specificity 
for peptides or proteins comprising the amino acid sequences. These antibodies can be then 

20 be used to screen cDNA expression libraries to isolate fiill-length cDNA clones of interest 
(Lemer, R. A. (1984) ^c/v. Immunol, 56:1; Maniatis). 

The nucleic acid fragments of the instant invention may be used to create transgenic 
plants in which the disclosed anthranilate synthase alpha subunit, anthranilate synthase beta 
subunit or tryptophan synthase alpha subunit are present at higher or lower levels than 

25 normal or in cell types or developmental stages in which they are not normally found. This 
would have the effect of altering the level of tryptophan in those cells. Manipulation of the 
levels of some of the antranilate synthase alpha subunits will also result in changes in the 
response to pathogen attack. Because this pathway is not followed for the production of 
tryptophan in higher animals, these enzymes are very good candidates for the discovery of 

30 herbicides and ftmgicides. 

Overexpression of the anthranilate synthase alpha subunit, the anthranilate synthase 
beta subunit or the tryptophan synthase alpha subunit proteins of the instant invention may 
be accomplished by first constructing a chimeric gene in which the coding region is operably 
linked to a promoter capable of directing expression of a gene in the desired tissues at the 

35 desired stage of development. For reasons of convenience, the chimeric gene may comprise 
promoter sequences and translation leader sequences derived from the same genes. 3' Non- 
coding sequences encoding transcription termination signals may also be provided. The 
instant chimeric gene may also comprise one or more introns in order to facilitate gene 
expression. 
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Plasmid vectors comprising the instant chimeric gene can then constructed. The 
choice of plasmid vector is dependent upon the method that will be used to transform host 
plants. The skilled artisan is well aware of the genetic elements that must be present on the 
plasmid vector in order to successfully transform, select and propagate host cells containing 
5 the chimeric gene. The skilled artisan will also recognize that different independent 

transformation events will result in different levels and patterns of expression (Jones et al., 
(1985) EMBOl 4:241 1-2418; De Almeida et ah, (1989) MoL Gen. Genetics 275:78-86), 
and thus that multiple events must be screened in order to obtain lines displaying the desired 
expression level and pattern. Such screening may be accomplished by Southem analysis of 
10 DNA, Northern analysis of mRNA expression. Western analysis of protein expression, or 
phenotypic analysis. 

For some applications it may be useful to direct the instant tryptophan biosynthetic 
enzyme to different cellular compartments, or to facilitate its secretion from the cell. It is 
thus envisioned that the chimeric gene described above may be further supplemented by 

15 altering the coding sequence to encode anthranilate synthase alpha subunit, anthranilate 
synthase beta subunit or tryptophan synthase alpha subunit with appropriate intracellular 
targeting sequences such as transit sequences (Keegstra, K. (1989) Cell 56:247-253), signal 
sequences or sequences encoding endoplasmic reticulum localization (Chrispeels, J J., 
(1991) Ann. Rev, Plant Phys. Plant MoL Biol. 42:21-53), or nuclear localization signals 

20 (Raikhel, N. (1992) Plant Phys, 100:1627-1632) added and/or with targeting sequences that 
are already present removed. While the references cited give examples of each of these, the 
list is not exhaustive and more targeting signals of utility may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of genes encoding 
anthranilate synthase alpha subunit, anthranilate synthase beta subunit or tryptophan 

25 synthase alpha subunit in plants for some applications. In order to accomplish this, a 

chimeric gene designed for co-suppression of the instant tryptophan biosynthetic enzyme 
can be constructed by linking a gene or gene fragment encoding an anthranilate synthase 
alpha subunit, anthranilate synthase beta subunit or tryptophan synthase alpha subunit to 
plant promoter sequences. Altematively, a chimeric gene designed to express antisense 

30 RNA for all or part of the instant nucleic acid fragment can be constructed by linking the 
gene or gene firagment in reverse orientation to plant promoter sequences. Either the co- 
suppression or antisense chimeric genes could be introduced into plants via transformation 
wherein expression of the corresponding endogenous genes are reduced or eliminated. 

The instant anthranilate synthase alpha subunit, anthranilate synthase beta subunit or 

35 tryptophan synthase alpha subunit (or portions thereof) may be produced in heterologous 
host cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies 
to the these proteins by methods well known to those skilled in the art. The antibodies are 
useful for detecting anthranilate synthase alpha subunit, anthranilate synthase beta subunit or 
tryptophan synthase alpha subunit in situ in cells or in vitro in cell extracts. Preferred 
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heterologous host cells for production of the instant anthranilate synthase alpha subunit, 
anthranilate synthase beta subunit or tryptophan synthase alpha subunit are microbial hosts. 
Microbial expression systems and expression vectors containing regulatory sequences that 
direct high level expression of foreign proteins are well known to those skilled in the art. 

5 Any of these could be used to construct a chimeric gene for production of the instant 
anthranilate synthase alpha subunit, anthranilate synthase beta subunit or tryptophan 
synthase alpha subunit. This chimeric gene could then be introduced into appropriate 
microorganisms via transformation to provide high level expression of the encoded 
tryptophan biosynthetic enzyme. An example of a vector for high level expression of the 

10 instant anthranilate synthase alpha subunit, anthranilate synthase beta subunit or tryptophan 
synthase alpha subunit in a bacterial host is provided (Example 7). 

Additionally, the instant anthranilate synthase alpha subunit, anthranilate synthase beta 
subunit or tryptophan synthase alpha subunit can be used as targets to facilitate design 
and/or identification of inhibitors of those enzymes that may be useful as herbicides. This is 

15 desirable because the anthranilate synthase alpha subunit, the anthranilate synthase beta 

subunit and the tryptophan synthase alpha subunit described herein catalyze various steps in 
tryptophan biosynthesis. Accordingly, inhibition of the activity of one or more of the 
enzymes described herein could lead to inhibition plant grov^h. Thus, the instant 
anthranilate synthase alpha subunit, anthranilate synthase beta subunit, tryptophan synthase 

20 alpha subunit could be appropriate for new herbicide discovery and design. 

All or a substantial portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they are a part 
of, and as markers for traits linked to those genes. Such information may be useful in plant 
breeding in order to develop lines with desired phenotypes. For example, the instant nucleic 

25 acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. 
Southern blots (Maniatis) of restriction-digested plant genomic DNA may be probed with 
the nucleic acid fragments of the instant invention. The resulting banding patterns may then 
be subjected to genetic analyses using computer programs such as MapMaker (Lander et at., 
(1987) Genomics 7:174-181) in order to construct a genetic map. In addition, the nucleic 

30 acid fragments of the instant invention may be used to probe Southern blots containing 
restriction endonuclease-treated genomic DNAs of a set of individuals representing parent 
and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted 
and used to calculate the position of the instant nucleic acid sequence in the genetic map 
previously obtained using this population (Botstein, D. et al., (1980) Am, J. Hum. Genet. 

35 52:314-331). 

The production and use of plant gene-derived probes for use in genetic mapping is 
described in R. Bematzky, R. and Tanksley, S. D. (1986) Plant Mol. Biol. Reporter 
4(l):'ilA\. Numerous publications describe genetic mapping of specific cDNA clones 
using the methodology outlined above or variations thereof For example, F2 intercross 
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populations, backcross populations, randomly mated populations, near isogenic lines, and 
other sets of individuals may be used for mapping. Such methodologies are well known to 
those skilled in the art. 

Nucleic acid probes derived from the instant nucleic acid sequences may also be used 

5 for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel, J. D., et 
al., In: Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1996, 
pp. 3 19-346, and references cited therein). 

In another embodiment, nucleic acid probes derived from the instant nucleic acid 
sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask, 

10 B. J. (1991) Trends Genet. 7:149-154). Although current methods of FISH mapping favor 
use of large clones (several to several hundred KB; see Laan, M. et al. (1995) Genome 
Research 5:13-20), improvements in sensitivity may allow performance of FISH mapping 
using shorter probes. 

A variety of nucleic acid amplification-based methods of genetic and physical 

15 mapping may be carried out using the instant nucleic acid sequences. Examples include 
allele-specific amplification (Kazazian, H. H. (1989) 1 Lab, Clin. Med 114(2):95-96l 
polymorphism of PCR-amplified fragments (CAPS; Sheffield, V. C. et al. (1993) Genomics 
7^:325-332), allele-specific ligation (Landegren, U. et al, (1988) Science 2¥/: 1077-1080), 
nucleotide extension reactions (Sokolov, B. P. (1990) Nucleic Acid Res. 75:3671), Radiation 

20 Hybrid Mapping (Walter, M. A. et al. (1997) Nature Genetics 7:22-28) and Happy Mapping 
(Dear, P. H. and Cook, P. R. (1989) Nucleic Acid Res. 77:6795-6807). For these methods, 
the sequence of a nucleic acid fragment is used to design and produce primer pairs for use in 
the amplification reaction or in primer extension reactions. The design of such primers is 
well known to those skilled in the art. In methods employing PCR-based genetic mapping, 

25 it may be necessary to identify DNA sequence differences between the parents of the 
mapping cross in the region corresponding to the instant nucleic acid sequence. This, 
however, is generally not necessary for mapping methods. 

Loss of function mutant phenotypes may be identified for the instant cDNA clones 
either by targeted gene disruption protocols or by identifying specific mutants for these 

30 genes contained in a maize population carrying mutations in all possible genes (Ballinger 
and Benzer, (1989) Proc. Natl. Acad Sci USA 86:9402; Koes et al., (1995) Proc. Natl Acad 
Sci USA 92:8149; Bensen et al., (1995) Plant Cell 7:75). The latter approach may be 
accomplished in two ways. First, short segments of the instant nucleic acid fragments may 
be used in polymerase chain reaction protocols in conjunction with a mutation tag sequence 

35 primer on DNAs prepared from a population of plants in which Mutator transposons or some 
other mutation-causing DNA element has been introduced (see Bensen, supra). The 
amplification of a specific DNA fragment with these primers indicates the insertion of the 
mutation tag element in or near the plant gene encoding the anthranilate synthase alpha 
subimit, the anthranilate synthase beta subunit or the tryptophan synthase alpha subunit. 
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Alternatively, the instant nucleic acid fragment may be used as a hybridization probe against 
PGR amplification products generated from the mutation population using the mutation tag 
sequence primer in conjunction with an arbitrary genomic site primer, such as that for a 
restriction enzyme site-anchored synthetic adaptor. With either method, a plant containing a 

5 mutation in the endogenous gene encoding an anthranilate synthase alpha subunit, an 

anthranilate synthase beta subunit or a tryptophan synthase alpha subunit can be identified 
and obtained. This mutant plant can then be used to determine or confirm the natural 
ftmction of the anthranilate synthase alpha subunit, the anthranilate synthase beta subunit or 
the tryptophan synthase alpha subunit gene product. 

10 EXAMPLES 

The present invention is further defined in the following Examples, in which all parts 
and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 
understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 

15 skilled in the art can ascertain the essential characteristics of this invention, and without 

departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usages and conditions. 

EXAMPLE 1 

Composition of cDNA Libraries: Isolation and Sequencing of cDNA Clones 
20 cDNA libraries representing mRNAs from various com, rice, soybean and wheat 

tissues were prepared. The characteristics of the libraries are described below. 



TABLE 2 

cDNA Libraries from Com, Rice, Soybean and Wheat 



Library 


Tissue 


Clone 


cbn2 


Com Developing Kernel Two Days After 
Pollination 


cbn2.pk0049.d2 


ceo In 


Com Cob of 67 Day Old Plants Grown in Green 
House* 


ccoln.pk0038.bl 


cdelc 


Com Developing Embryo 20 Days After 
Pollination 


cdelc.pk004.g2 


ceb3 


Com Embryo 20 Days After Pollination 


ceb3.pk001l.cl2 


cen3n 


Cora Endosperm 20 Days After Pollination* 


cen3n.pk0047.g5 




cen3n.pk0210.c3 
cen3n.pk0212.h6 


cepe7 


Com 7 Day Old Epicotyl From Etiolated Seedling 


cepe7.pk0012.g2 


chp2 


Com (B73 and MK593) 1 1 Day Old Leaf Treated 


chp2.pk0020.e2 


24 Hours With Herbicides* ♦ 




cpllc 


Com Pooled BMS Treated With Chemical 


cpllc.pk008.m4 


Chelators*** 




or In 


Com Root From 7 Day Old Seedlings* 
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Library Tissue 

csiln Com Silk* 
ctaln Com Tassel* 
ctn 1 c Com Tassel, Night Harvested 
Ikr Com 19 Day Old Seed 

p0005 Com Immature Ear 
pOO 1 5 Com Embryo 1 3 Days After Pollination 
p003 1 Com (CM45) Shoot Culture 

p0032 Com Regenerating Callus (Hi-II 223a and 1 129e), 
10 and 14 Days After Auxin Removal, Pooled 

p0096 Com Scutelum 2 and 3 Days After Germinating 
pO 1 02 Com Early Meiosis Tassels* 
pO 1 1 5 Com Leaf and Sheath Meristem Tissue Collected 
fi-omlOth,! 1th, and 12th Leaves, Pooled 

pO 1 25 Com Anther Prophase I 

p0126 Com Leaf Tissue (V8-V10****), Night-Harvested 

p0128 Com Primary and Secondary Immature Ear 

rcaln Rice Callus* 

rds2c Rice Developing Seeds 

res 1 c Rice Etiolated Seedling 

rlOn Rice 15Day OldLeaP 

rls24 Rice Leaf 1 5 Days After Germination, 24 Hours 
After Infection of Strain Magaporthe grisea 
4360-R-67 (AVR2-YAM0); Susceptible 

rrl Rice Root of Two Week Old Developing Seedling 

sdp3c Soybean Developing Pods (8-9 mm) 
sfl 1 Soybean Immature Flower 
sls2c Soybean Infected With Sclerotinia sclerotiorum 
mycelium 

src3c Soybean 8 Day Old Root Infected With Cyst 
Nematode 

wl 1 n Wheat Leaf From 7 Day Old Seedling* 

wle 1 n Wheat Leaf From 7 Day Old Etiolated Seedling* 

wImO Wheat Seedlings 0 Hour After Inoculation With 

Erysiphe graminis f. sp tritici 
wlm96 Wheat Seedlings 96 Hours After Inoculation With 

Erysiphe graminis / sp tritici 



Clone 

csiln.pk0017.d4 

ctaln.pk0051.g3 

ctnlc.pkOOl.llO 

lkr.pk0013.gl 

p0005.cbmew77r 

p0015.cdpfkl2r 

p0031xcmai08r 
p0031.ccmar07r 

p0032.crcas26r 

p0096.cnamt32r 

p0102.ceraq01r 

p0115.clsmj70r 

p0125.czabf83r 

p0126.cnlcu75r 

p0128.cpibl07r 

rcaln.pk004.p22 

rds2c.pk004,i9 

reslc.pk004.c4 

rl0n.pkll3.b4 

rls24.pk0002.e2 

rrl.pk0038.h6 
rrl.pk0071.f2 

sdp3c.pk019.c6 

sfll.pkl29.dl0 

sls2c.pk025.dll 

src3c.pk004.cl9 

wlln.pk0109.d5 
wlln.pk0110.cl 

wleln.pk0075.b4 

wlm0.pk0028.f4 

wlm96.pk026.jl 

wlm96.pk041.h3 

wlm96.pk045.cll 
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♦These libraries were normalized essentially as described in U.S. Pat. No. 5,482,845 
♦♦Application of 2-[(2,4-dihydro-2,6.9-trimethyl[llbenzothiopyrano[4,3-c]pyra2ol-8- 
yl)carbonyl]-l,3-cyclohexanedione 5,5-dioxide (synthesis and methods of using this 
compound are described in WO 97/19087, incorporated herein by reference) and 2- 
5 [(2 .3-dihydro-5 , 8-dimethylspiro[4//- 1 -benzothiopyran-4,2* -[ 1 ,3]dioxolan]-6- 

yl)carbonyl]-l,3-cyclohexanedione 5,5-dioxide (also named 2-[(2,3-dihydro-5,8- 
dimethyIspiro[4H-l-benzothiopyran-4,2*-[l,3]dioxolan]-6-yl)carbonyl]-3-hydroxy-2- 
cyclohexen-l-one 5,5-dioxide; synthesis and methods, of usmg diis compound are 
described in WO 97/01550, incorporated herein by reference). 
10 ♦ ♦♦Chemicals used included nitrilotriacetic acid, mercaptobenzodiiazole, diethyldithio- 
carbamate 

♦♦♦♦For description of com stages see Iowa State University Cooperative Extension Service 
Special Report No. 48. 

15 cDNA libraries were prepared in Uni-ZAP*^^ XR vectors according to the 

manufacturer's protocol (Stratagene Cloning Systems, La Jolla, CA). Conversion of the 
Uni-ZAP*^*^ XR libraries into plasmid libraries was accomplished according to the protocol 
provided by Stratagene. Upon conversion, cDNA inserts were contained in the plasmid 
vector pBluescript. cDNA inserts from randomly picked bacterial colonies containing 
20 recombinant pBluescript plasmids were amplified via polymerase chain reaction using 
primers specific for vector sequences flanking the inserted cDNA sequences or plasmid 
DNA was prepared from cultured bacterial cells. Amplified insert DNAs or plasmid DNAs 
were sequenced in dye-primer sequencing reactions to generate partial cDNA sequences 
(expressed sequence tags or "ESTs"; see Adams, M. D. et al., (\99\) Science 252:1651). 
25 The resulting ESTs were analyzed using a Perkin Elmer Model 377 fluorescent sequencer. 

EXAMPLE 2 
Identification of cDNA Clones 
ESTs encoding tryptophan biosynthetic enzymes were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul, S. F., et al., (1993) J. MoL Biol. 
30 2 ; 5:403-4 10; see also vww.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS 
translations, sequences derived from the 3 -dimensional structure Brookhaven Protein Data 
Bank, the last major release of the SWISS-PROT protein sequence database, EMBL, and 
DDBJ databases). The cDNA sequences obtained in Example 1 were analyzed for similarity 
35 to all publicly available DNA sequences contained in the "nr" database using the BLASTN 
algorithm provided by the National Center for Biotechnology Information (NCBI). The 
DNA sequences were translated in all reading frames and compared for similarity to all 
publicly available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish, W. and States, D. J. (1993) Nature Genetics J:266-272) provided by the 
40 NCBI. For convenience, the P-value (probability) of observing a match of a cDNA 

sequence to a sequence contained in the searched databases merely by chance as calculated 
by BLAST are reported herein as "pLog" values, which represent the negative of the 
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logarithm of the reported P-value. Accordingly, the greater the pLog value, the greater the 
likelihood that the cDNA sequence and the BLAST "hit" represent homologous proteins. 

EXAMPLE 3 

rharacterization of cDNA Clones Encoding A nthranilate Synthase 
5 The BLASTX search using the EST sequences from several com, rice and wheat 

clones revealed similarity of the proteins encoded by the cDNAs to anthranilate synthase 
from Arabidopsis thaliana and Ruta graveolens. The BLASTX search using the nucleotide 
sequence from clone wreln.pk0036.gI0 also revealed similarity of the protein encoded by 
the cDNA to para-aminobenzoate synthase from Streptomyces pristinaespiralis (GenBank 
10 Accession No.U60417; pLog = 27.39). In the process of comparing the com ESTs it was 
found that clones cen3n.pk0210.c3, ctaln.pk0025.f8 and cen3n.pk0047.g5 had overlapping 
regions of homology. A comparison of the com ESTs from clones cen3n.pk0212.h6 and 
csiln.pk0017.d4 also had overiapping regions of homology. Using this homology it was 
possible to align the ESTs and assemble two contigs (a contig is an assemblage of 
15 overlapping nucleic acid sequences to form one contiguous nucleotide sequence). The 

individual sequences were assembled into unique contiguous nucleotide sequences encoding 
anthranilate synthase from com. The database accession numbers and BLAST results for 
each of these ESTs and contigs are shown in Table 3: 

20 TABLE 3 

BLAST Results for Clones Encoding Polypeptides Homologous 
to Anthranilate Synthase 



Database 

Clone Organism Accession No Blast Score pLog 



Contig of: 


A. 


thaliana 


GenBank L22585 


70.69 


cen3n.pk0210.c3b 

ctaln.pk0025.f8 

cen3n.pk0047.g5b 










ceb3.pk0011.cl2 


R. 


graveolens 


GenBank L34344 


48.15 


ctaln.pk0051.g3 


R. 


graveolens 


GenBank L34344 


32.22 


cbn2.pk0049.d2 


R. 


graveolens 


GenBank L34344 


35.00 


Contig of: 


A. 


thaliana 


GenBank L22585 


34.00 


cen3n.pk0212.h6 
csiln.pk0017.d4 










ccoln.pk0038.bl 


A. 


thaliana 


GenBank L22585 


17.69 


rls24.pk0002.e2 


A. 


thaliana 


GenBank L22585 


20.69 


wleln.pk0075.b4 


R. 


graveolens 


GenBank L34343 


38.00 



Characterization ofcDNA Clones Encoding Anthranilate Synthase Alpha Subunit 
25 The sequence of the entire cDNA insert in clone ctal n.pkOOS 1 .g3 was determined. A 

contig was assembled with this sequence and sequences from portions of the cDNA inserts 
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10 



15 



in clones cdelc.pk004.g2, p0125.czabf83r, p0005.cbmew77r, p0096.cnamG2r, 
p0032.crcas26r, p0031 .ccmar07r, p0015.cdpfkl2r and p0128.cpibl07r. The sequence of the 
entire cDNA insert in clone ctaln.pkO051.g3 was determined. A contig was assembled with 
this sequence and sequences from portions of the cDNA in clones p0115.clsmj70r, 
cbn2.pk0049.d2, cpllc.pk008.m4 and p0102.ceraq01r. The BLASTX search using these 
contig sequences and the EST sequences from clones rcaln.pk004.p22, sdp3c.pk019.c6, 
sfll.pkl29.dl0, wleln.pk0075.b4, the 5' sequence from clone wlm96.pk026.jl and the 3' 
sequence from clone wlm96.pk026.jl revealed similarity of the protein encoded by the 
cDNA to anthranilate synthase alpha subunit from Ruta graveolens (NCBI General Identifier 
No. 960289). The BLAST results for each of these sequences are shown in Table 4: 

TABLE 4 

BLAST Results for Clones Encoding Polypeptides Homologous 
to Anthranilate Synthase Alpha Subunit 



Clone 



BLAST pLog Score. 
NCBI General Identifier No. 960289 



Contig of: 

cdelc.pk004.g2 

p0125.czabf83r 

p0005.cbmew77r 

p0096.cnamt32r 

p0032.crcas26r 

p0031.ccmar07r 

p0015.cdpfkl2r 

p0128.cpibl07r 

ctaln.pk0051.g3 

Contig of: 

p0115.clsmj70r 
cbn2.pk0049.d2 
cpllc.pk008.m4 
p0102.ceraq01r 
ceb3.pk00ll.cl2 

rcaln.pk004.p22 

sdp3c.pk019.c6 

sfll.pkl29.dl0 

wleln.pk0075.b4 (5'end) 

wleln.pk0075.b4 (3'end) 

wlm96.pk026.jl 



>254 



>254 



20.15 
48.52 
34.70 
57.00 
35.70 
5.40 



20 



The sequence of the contig assembled from the entire cDNA insert from clone 
ctaln.pkOOSl .g3 and a portion of the cDNA insert in clones cdelc.pk004.g2, 
p0125.czabf83r, p0005.cbmew77r, p0096.cnamt32r, p0032.crcas26r, pO03 1 .ccmar07r, 
p0015.cdpfkl2r and p0128.cpibl07r is shown in SEQ ID NO: 1; the deduced amino acid 
sequence of this cDNA is shown in SEQ ID N0:2. The amino acid sequence set forth in 
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SEQ ID N0:2 was evaluated by BLASTP, yielding a pLog value >254 versus the Ruia 
graveolens sequence. The sequence of the contig assembled from the entire cDNA insert in 
clone ctaln.pk0051.g3 and a portion of the cDNA in clones pOl 15xlsmj70r, 
cbn2.pk0049.d2, cpllc.pk008.m4 and p0102.ceraq01r is shown in SEQ ID N0:3; the 

5 deduced amino acid sequence of this cDNA is shown in SEQ ID N0:4. The sequence of a 
portion of the cDNA insert from clone rcaln.pk004.p22 is shown in SEQ ID N0:5; the 
deduced amino acid sequence of this cDNA is shovm in SEQ ID N0:6. The sequence of a 
portion of the cDNA insert from clone sdp3c.pk019x6 is shovm in SEQ ID N0:7; the 
deduced amino acid sequence of this cDNA is shown in SEQ ID N0:8. The sequence of a 

10 portion of the cDNA insert from clone sfll.pkl29.dl0 is shown in SEQ ID N0:9; the 

deduced amino acid sequence of this cDNA is shovm in SEQ ID NO: 10. The sequence of 
the 5* terminal portion of the cDNA insert from clone wleln.pk0075.b4 is shown in SEQ ID 
NO: 1 1 ; the deduced amino acid sequence of this cDNA is shovm in SEQ ID NO: 1 2. The 
sequence of the 3' terminal portion of the cDNA insert from clone wleln.pk0075.b4 is 

15 shown in SEQ ID NO: 13; the deduced amino acid sequence of this cDNA is shown in SEQ 
ID NO: 14. The sequence of a portion of the cDNA insert from clone wim96.pk026.j 1 is 
shown in SEQ ID NO: 15; the deduced amino acid sequence of this cDNA is shown in SEQ 
ID NO: 16. 

Figure 2 presents an alignment of the amino acid sequence set forth in SEQ ID N0:2 
20 and the Ruta graveolens anthranilate synthase alpha subunit sequence (SEQ ID NO:39). The 
amino acid sequence set forth in SEQ ID N0:2 is 64.7% similar to the Ruta graveolens 
sequence. Sequence alignments and percent similarity calculations were performed using 
the Megalign program of the LASARGENE bioinformatics computing suite (DNASTAR 
Inc., Madison, WI). Pairwise alignment of the sequences was performed using the Clustal 
25 method of alignment (Higgins, D.G. and Sharp, P.M. (1989) CABIOS. 5:151-153) with the 
default parameters (KTUPLE 1, GAP PENALTY=3, WIND0W=5 and DIAGONALS 
SAVED=5). 

Sequence alignments and BLAST scores and probabilities indicate that the instant 
nucleic acid fragments encode an entire com anthranilate synthase alpha subunit and 
30 portionsofcom, rice, soybean and wheat anthranilate synthase alpha subunits. These - 

sequences represent the first com, rice, soybean and wheat sequences encoding anthranilate 
synthase alpha subunit. 

Characterization ofcDNA Clones Encoding Anthranilate Synthase Beta Subunit 

The sequence of a larger portion of the cDNA insert in clone cen3n,pk0210.c3 was 
35 determined, this sequence includes the sequence from clone ctaln.pk0025.f8. A contig was 
assembled with the sequence from clone cen3n.pk0210.c3 and sequence from a portion of 
the cDNA in clones cepe7.pk0012.g2 and cen3n.pk0047.g5. The BLASTX search using 
these contig sequences, the sequences from the contig assembled of clones p0126.cnlcu75r, 
cen3n.pk0212.h6, ccoln.pk0038.bl, csiln.pk0017.d4, p003 1 .ccmaiOSr and ctnlc.pkOOl.llO, 
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the contig assembled from clones rds2c.pk004.i9, rls24.pk0002.e2 and rl0n.pkll3.b4 and the 
EST sequences from clone wlm96.pk045.cl 1 revealed similarity of the proteins encoded by 
the cDNAs to anthranilate synthase beta subunit from Arabidopsis thaliana (NCBI General 
Identifier No. 541 849). The BLAST results for each of these ESTs are shown in Table 5: 



TABLE 5 

BLAST Results for Clones Encoding Polypeptides Homologous 
to Anthranilate Synthase Beta Subunit 





BLAST pLog Score 


\^ione 


NCBI General Identifier No. 541849 


Contig of: 


OA CO 






cepe7.pk0012.g2 




cen3n,pk0047.g5 




Contig of: 


110.0 


p0126.cnlcu75r 




cen3n,pk0212.h6 




ccoln.pk0038.bl 




csiln.pk0017.d4 




p0031xcmai08r 




ctnlc.pkOOl.llO 




Contig of: 


20.15 


rds2c.pk004.i9 




rls24.pk0002.e2 




rl0n.pkll3.b4 




wlm96.pk045.cll 


15.00 



10 The sequence of the contig assembled from a portion of the cDNA insert from clones 

cen3n.pk0210.c3, cepe7.pk0012.g2 and cen3n.pk0047.g5 is shown in SEQ ID N0:17; the 
deduced amino acid sequence of this cDNA is shown in SEQ ID NO: 18. The sequence of 
the contig assembled from a portion of the cDNA insert from clones p0126.cnlcu75r, 
cen3n.pk0212.h6, ccoln.pk0038.bl, csiln.pk0017.d4, p003 1 .ccmaiOSr and ctnlc.pkOOl.llO 

15 is shown in SEQ ID NO: 1 7; the deduced amino acid sequence of this cDNA is shown in 
SEQ ID NO: 1 8. The amino acid sequence set forth in SEQ ID NO: 1 8 was evaluated by 
BLASTP, yielding a pLog value of 92.22 versus the Arabidopsis thaliana sequence. The 
sequence of the contig assembled from a portion of the cDNA insert from clones 
rds2c.pk004.i9, rls24.pk0002.e2 and rlOn.pkl 13.b4 is shown in SEQ ID NO: 19; the deduced 

20 amino acid sequence of this cDNA is shown in SEQ ID NO:20. The sequence of a portion 
of the cDNA insert from clone wlm96.pk045.cl 1 is shovm in SEQ ID N0:2l; the deduced 
amino acid sequence of this cDNA is shown in SEQ ID NO:22. 

Figure 3 presents an alignment of the amino acid sequences set forth in SEQ ID 
NOs:20 and 22 and the Arabidopsis thaliana anthranilate synthase beta subunit sequence 

25 (SEQ ID NO:40). The amino acid sequences set forth in SEQ ID NO:20 are 58.0% similar 
to the Arabidopsis thaliana sequence while the amino acid sequences set forth in SEQ ID 
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NO:22 are 57.1% similar to the Arabidopsis thaliana sequence. Sequence alignments and 
percent similarity calculations were performed using the Megalign program of the 
LASARGENE bioinformatics computing suite (DNASTAR Inc., Madison, WI). Multiple 
alignment of the sequences was performed using the Clustal method of alignment (Higgins, 

5 D.a and Sharp, P.M. (1989) CABIOS, 5:151-153) with the defauh parameters (GAP 
PENALTY-10, GAP LENGTH PENALTY-10). 

Sequence alignments and BLAST scores and probabilities indicate that the instant 
nucleic acid fragments encode an entire com anthranilate synthase beta subunit, an almost 
entire rice anthranilate synthase beta subunit and portions of com and wheat anthranilate 

10 synthase beta subunit. These sequences represent the first com, rice and wheat sequences 
encoding anthranilate synthase beta subunit. 

EXAMPLE 4 

Characterization of cDNA Clones Encoding Tryptophan Synthase 
The BLASTX search using the EST sequences from several com, rice, soybean and 

15 wheat clones revealed similarity of the proteins encoded by the cDNAs to tryptophan 
synthase from Zea mays and Arabidopsis thaliana. In the process of comparing the rice 
ESTs it was found that clones wlln.pkO109.d5 and wUn.pkOl 10x1 had overlapping regions 
of homology. A comparison of the com ESTs from clones chp2.pk0020.e2, 
cen3n.pk0147.d4, lkr.pk0013.gl, csl/pk0089.a8, ctaln.pk0057.d6, cen3n.pk0014.c5, 

20 cen3n.pk0193.c5 and cen3n.pkG135.dl also revealed overlapping regions of homology. A 
comparison of the com ESTs from clones m.l5.6,g07.sk20, m.l5.4.g07.sk20 and 
ccoln.pk0030.g7 revealed overlapping regions of homology. Using this homology it was 
possible to align the ESTs and assemble four contigs (a contig is an assemblage of 
overlapping nucleic acid sequences to form one contiguous nucleotide sequence). The 

25 individual sequences were assembled into unique contiguous nucleotide sequences encoding 
tryptophan synthase from com, rice, soybean and wheat. The database accession numbers 
and BLAST results for each of these ESTs and contigs are shown in Table 6: 



TABLE 6 

30 BLASTX Results for Clones Encoding Polypeptides Homologous 
to Trypotophan Synthase 



Clone 



Database 
Organism Accession No 



Blast Score pLog 



rrl.pk0038.h6 

Contig of: 

wlln.pk0109.d5 
wlln.pk0110.cl 

wlln.pk0067.f2 

crln,pk0033,f8 

crln.pk0052.b8 



A. thaliana 
Z. mays 

A. thaliana 
Z mays 
Z, mays 



GenBankU 18993 
GenBank X76713 

GenBankU 18993 
GenBank X76713 
GenBank X76713 



26.00 
52.39 

19.69 
47.69 
25.70 
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Database 




Clone 


Organism 


Accession No 


Blast Score pLog 


Contig of: 


Z mays 


UentJanK a / o / 1 j 


R'^ AO 


chp2.pk0020,e2 








cen3n.pk0147.d4 








lkr.pk0013.gl 
















rtflln nk0057 d6 








cenln nk0014 c5 








cen3n.pk0193x5 








cen3n.pk0135.dl 








Contig of: 


Z mays 


GenBank X76713 


25.52 


m.l5.6.g07.sk20 








ml5.4.g07.sk20 








ccoln.pk0030.g7 









Characterization of cDNA Clones Encoding Tryptophan Synthase Alpha Submit 

The sequence of the entire cDNA insert in clone lkr.pk0013,gl and a larger portion of 
the sequence from clone chp2.pk0020.e2 have been determined. A contig has been 
5 assembled with these two sequences and they have been found to include the sequences from 
clones ccoln.pk0030.g7, m.l5.6.g07.sk20. cen3n.pk0014,c5, cen3n.pk0135.dl, 
cen3n.pk0147.d4, cen3n.pk0193.c5, ctaln.pk0057.d6, csl.pk0089.a8 and m.l5.6.g07.sk20. 
The sequence from the entire cDNA insert in clone crln.pk0052.b8 has been determined, it 
includes the sequences from clone crln.pk0033.f8. The BLASTX search using these 

10 sequences, the sequence from the contig assembled of clones reslc.pk004.c4 and 

rrl.pk0071.f2, the EST sequences from clone rrl.pk0038.h6 and the sequence of the contig 
assembled from clones src3c.pk004.cl9 and sls2c.pk025.dl 1 revealed similarity of the 
proteins encoded by the cDNAs to tryptophan synthase alpha subunit from Arabidopsis 
thaliana (NCBI General Identifier No. 2129755). The BLASTX search using the sequence 

15 of the contig assembled from clones wlln.pk01G9.d5 and wlln.pkOl lO.cl revealed similarity 
of the proteins encoded by the cDNAs to tryptophan synthase alpha subunit from Zea mays 
(NCBI General Identifier No. 11 74783). The BLAST results for each of these sequences are 
shown in Table 7: 
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TABLE 7 

BLAST Results for Clones Encoding Polypeptides Homologous 
to Tryptophan Synthase Alpha Subunit 



Clone 


NCBI General Identifier No. 


BLAST pLog Score 


Contig of: 


2129755 


124.0 


chp2.pk0020.e2 






lkr.pk0013.gl 




120.0 


crln.pk0052.b8 


2129755 


L^OUUg Ui. 


2129755 


78.70 


rf>c 1 n nlrOnd p4 






rrl.pk007Lf2 






rrl nWnniR h6 


2129755 


125.0 


Contig of: 




49 00 


src3c.pk004.cl9 






sls2c.pk025.dl 1 






Contig of: 


1174783 


33.70 


wlm0.pk0028.f4 






wlm96.pk041.h3 






Contig of: 


1174783 


85.70 


wlln.pk0109.d5 






wlln.pkOllO.cl 







5 The sequence of the contig assembled from the entire cDNA insert in clone 

lkr.pk0013.gl and a large portion of the cDNA insert in clone chp2.pk0020.e2 is shown in 
SEQ ID NO:25; the deduced amino acid sequence of this cDNA is shown in SEQ ID NO:26. 
The amino acid sequence set forth in SEQ ID NO:26 was evaluated by BLASTP, yielding a 
pLog value of 105.0 versus the Arabidopsis thaliana sequence. The sequence of the entire 

10 cDNA insert in clone crln.pk0052.b8 was determined and is shown in SEQ ID NO:27; the 
deduced amino acid sequence of this cDNA is shown in SEQ ID NO:28. The amino acid 
sequence set forth in SEQ ID NO:28 was evaluated by BLASTP, yielding a pLog value of 
98.22 versus the Arabidopsis thaliana sequence. The sequence of the contig assembled from 
a portion of the cDNA insert in clones reslc.pk004.c4 and rrl.pk0071.f2 is shown in SEQ 

15 ID NO:29; the deduced amino acid sequence of this cDNA is shown in SEQ ID NO:30. The 
sequence of the entire cDNA insert in clone rrl.pk0038.h6 was determined and is shown in 
SEQ ID N0:3 1; the deduced amino acid sequence of this cDNA is shovm in SEQ ID NO;32. 
The amino acid sequence set forth in SEQ ID NO:32 was evaluated by BLASTP, yielding a 
pLog value of 106.0 versus the Arabidopsis thaliana sequence. The sequence of the contig 

20 assembled from a portion of the cDNA insert from clones src3c.pk004.cl9 and 

sls2c.pk025.dl 1 is shown in SEQ ID NO:33; the deduced amino acid sequence of this 
cDNA is shown in SEQ ID NO:34. The sequence of the contig assembled from a portion of 
the cDNA insert from clones wlm0.pk0028.f4 and wlm96.pk041.h3 is shown in SEQ ID 
NO:35; the deduced amino acid sequence of this cDNA is shown in SEQ ID NO:36. The 
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sequence of the contig assembled from a portion of the cDNA insert from clones 
wlln.pk0109.d5 and wlln.pkOllO.cl is shown in SEQ ID NO:37; the deduced amino acid 
sequence of this cDNA is shown in SEQ ID NO:38. 

Figure 3 presents an alignment of the amino acid sequences set forth in SEQ ID 
5 NOs:26, 28 and 32 and the Arabidopsis thaliana sequence (SEQ ID N0:41). The data in 
Table 8 represents a calculation of the percent identity of.the amino acid sequences set forth 
in SEQ ID NOs:26, 28 and 32 and the Arabidopsis thaliana tryptophan synthase alpha 
subunit sequence. 

10 TABLE 8 

Percent Identity of Amino Acid Sequences Deduced From the Nucleotide Sequences of 
cDNA Clones Encoding Polypeptides Homologous 



Clone 


SEOIDNO. 


Percent Identity to 
2129755 


Contig of: 


26 


59.0 


chp2.pk0020.e2 






lkr.pk0013.gl 






crln.pk0052.b8 


28 


56.7 


n:l.pk0038.h6 


32 


59.9 



15 Sequence alignments and percent similarity calculations were performed using the Megalign 
program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., Madison, 
WI). Multiple alignment of the sequences was performed using the Clustal method of 
alignment (Higgins, D.G. and Sharp, P.M. (1989) CABIOS, 5:151-153) with the defauh 
parameters (GAP PENALTY=10, GAP LENGTH PENALTY- 10). 

20 Sequence alignments and BLAST scores and probabilities indicate that the instant 

nucleic acid fragments encode entire or nearly entire com and rice tryptophan synthase alpha 
subunit and portions of rice, soybean and wheat tryptophan synthase alpha subunit. These 
sequences represent the first rice, soybean and wheat and variant com sequences encoding 
tryptophan synthase alpha subunit. 

25 EXAMPLE 5 

Expression of Chimeric Genes in Monocot Cells 
A chimeric gene comprising a cDNA encoding a tryptophan biosynthetic enzyme in 
sense orientation with respect to the maize 27 kD zein promoter that is located 5* to the 
cDNA fragment, and the 10 kD zein 3' end that is located 3' to the cDNA fragment, can be 

30 constmcted. The cDNA fragment of this gene may be generated by polymerase chain 

reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites 
(Ncol or Smal) can be incorporated into the oligonucleotides to provide proper orientation of 
the DNA fragment when inserted into the digested vector pML103 as described below. 
Amplification is then performed in a standard PCR. The amplified DNA is then digested 
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with restriction enzymes Ncol and Smal and fractionated on an agarose gel. The appropriate 
band can be isolated from the gel and combined with a 4.9 kb Ncol-Smal fragment of the 
plasmid pML103. Plasmid pMLlOS has been deposited under the terms of the Budapest 
Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., Manassas, VA 
5 201 10-2209), and bears accession number ATCC 97366. The DNA segment from pML103 
contains a 1.05 kb Sall-Ncol promoter fragment of the maize 27 kD zein gene and a 0,96 kb 
Smal-Sall fragment from the 3* end of the maize 10 kD zein gene in the vector pGem9Zf(+) 
(Promega). Vector and insert DNA can be ligated at 15°C overnight, essentially as 
described (Maniatis). The ligated DNA may then be used to transform £. coli XL 1 -Blue 

10 (Epicurian Coli XL-1 Blue^"; Stratagene). Bacterial transformants can be screened by 

restriction enzyme digestion of plasmid DNA and limited nucleotide sequence analysis using 
the dideoxy chain termination method (Sequenase^" DNA Sequencing Kit; U. S. 
Biochemical). The resulting plasmid construct would comprise a chimeric gene encoding, in 
the 5' to 3' direction, the maize 27 kD zein promoter, a cDNA fragment encoding a 

15 tryptophan biosynthetic enzyme, and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into com cells by the 
following procedure. Immature com embryos can be dissected from developing caryopses 
derived from crosses of the inbred com lines H99 and LH132. The embryos are isolated 10 
to 1 1 days after pollination when they are 1 .0 to 1 .5 mm long. The embryos are then placed 

20 with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et 
al., (1975) ScL Sin. Peking 18:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 
proembryoids and embryoids borne on suspensor structures proliferates from the scutellum 
of these immature embryos. The embryogenic callus isolated from the primary explant can 

25 be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. 

The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, 
Germany) may be used in transformation experiments in order to provide for a selectable 
marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) 
which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers 

30 resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat 
gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus 
(Odell et al. (1985) //a/wre 313:810-812) and the 3' region of the nopaline synthase gene 
from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

The particle bombardment method (Klein et al., (1987) Nature 327:70-73) may be 

35 used to transfer genes to the callus culture cells. According to this method, gold particles 
(1 |im in diameter) are coated with DNA using the following technique. Ten ^g of plasmid 
DNAs are added to 50 fxL of a suspension of gold particles (60 mg per mL). Calcium 
chloride (50 of a 2.5 M solution) and spermidine free base (20 of a 1.0 M solution) 
are added to the particles. The suspension is vortexed during the addition of these solutions. 
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After 10 minutes, the tubes are briefly centrifliged (5 sec at 15,000 rpm) and the supernatant 
removed. The particles are resuspended in 200 [iL of absolute ethanol, centrifiiged again 
and the supernatant removed. The ethanol rinse is performed again and the particles 
resuspended in a final volume of 30 |iL of ethanol. An aliquot (5 |aL) of the DNA-coated 
5 gold particles can be placed in the center of a Kapton^^ flying disc (Bio-Rad Labs). The 
particles are then accelerated into the com tissue with a Biolistic^" PDS-lOOO/He (Bio-Rad 
Instruments, Hercules CA), using a helium pressure of 1000 psi, a gap distance of 0.5 cm 
and a flying distance of 1 .0 cm. 

For bombardment, the embryogenic tissue is placed on filter paper over agarose- 

10 solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of 
about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of 
the PDS-lOOO/He approximately 8 cm from the stopping screen. The air in the chamber is 
then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in the shock 

15 tube reaches 1000 psi. 

Seven days after bombardment the tissue can be transferred to N6 medium that 
contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 
grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 
fresh N6 medium containing gluphosinate. After 6 weeks, areas of about I cm in diameter 

20 of actively growing callus can be identified on some of the plates containing the glufosinate- 
supplemented medium. These calli may continue to grow when sub-cultured on the 
selective medium. 

Plants can be regenerated from the transgenic callus by first transferring clusters of 
tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
25 tissue can be transferred to regeneration medium (Fromm et al., (1990) Bio/Technology 
5:833-839). 

EXAMPLE 6 
Expression of Chimeric Genes in Dicot Cells 
A seed-specific expression cassette composed of the promoter and transcription 
30 terminator from the gene encoding the p subunit of the seed storage protein phaseolin from 
the bean Phaseolus vulgaris (Doyle et al. (1986) 1 Biol Chem. 261 :9228-9238) can be used 
for expression of the instant tryptophan biosynthetic enzyme in transformed soybean. The 
phaseolin cassette includes about 500 nucleotides upstream (5') from the translation initiation 
codon and about 1650 nucleotides downstream (3') from the translation stop codon of 
35 phaseolin. Between the 5' and 3* regions are the unique restriction endonuclease sites Nco I 
(which includes the ATG translation initiation codon), Sma I, Kpn I and Xba L The entire 
cassette is flanked by Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain reaction 
(PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
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incorporated into the oligonucleotides to provide proper orientation of the DNA fragment 
when inserted into the expression vector. Amplification is then performed as described 
above, and the isolated fragment is inserted into a pUC18 vector carrying the seed 
expression cassette. 

5 Soybean embroys may then be transformed v^ith the expression vector comprising 

sequences encoding a tryptophan biosynthetic enzyme. To induce somatic embryos, 
cotyledons, 3-5 mm in length dissected from surface sterilized, immature seeds of the 
soybean cultivar A2872, can be cultured in the light or dark at 26*'C on an appropriate agar 
medium for 6-10 weeks. Somatic embryos which produce secondary embryos are then 

10 excised and placed into a suitable liquid medium. After repeated selection for clusters of 
somatic embryos which multiplied as early, globular staged embryos, the suspensions are 
maintained as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a 
rotary shaker, 150 rpm, at 26''C with florescent lights on a 16:8 hour day/night schedule. 

15 Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid medium. 

Soybean embryogenic suspension cultures may then be transformed by the method of 
particle gun bombardment (Kline et al. (1987) Nature (London) 527:70, U.S. Patent 
No. 4,945,050). A DuPont Biolistic™ PDSIOOO/HE instrument (helium retrofit) can be used 

20 for these transformations. 

A selectable marker gene which can be used to facilitate soybean transformation is a 
chimeric gene composed of the 35 S promoter from Cauliflower Mosaic Virus (Odell et 
aL(1985) Nature 5/3:810-812), the hygromycin phosphotransferase gene from plasmid 
pJR225 (from E, coli; Gritz et al.(1983) Gene 25:179-188) and the 3' region of the nopaline 

25 synthase gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The seed 
expression cassette comprising the phaseolin 5' region, the fragment encoding the tryptophan 
biosynthetic enzyme and the phaseolin 3' region can be isolated as a restriction fragment. 
This fragment can then be inserted into a unique restriction site of the vector carrying the 
marker gene. 

30 To 50 \iL of a 60 mg/mL 1 ^im gold particle suspension is added (in order): 5 jtL 

DNA (1 Hg/|uL), 20 \i\ spermidine (0.1 M), and 50 ^L CaCl2 (2.5 M). The particle 
preparation is then agitated for three minutes, spun in a microfuge for 10 seconds and the 
supernatant removed. The DNA -coated particles are then washed once in 400 |iL 70% 
ethanol and resuspended in 40 |iL of anhydrous ethanol. The DNA/particle suspension can 

35 be sonicated three times for one second each. Five |iL of the DNA-coated gold particles are 
then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in an 
empty 60x15 mm petri dish and the residual liquid removed from the tissue with a pipette. 
For each transformation experiment, approximately 5-10 plates of tissue are normally 
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bombarded. Membrane rupture pressure is set at 1 100 psi and the chamber is evacuated to a 
vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the 
retaining screen and bombarded three times. Following bombardment, the tissue can be 
divided in half and placed back into liquid and cultured as described above. 
5 Five to seven days post bombardment, the liquid media may be exchanged with fresh 

media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL 
hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post 
bombardment, green, transformed tissue may be observed growing from untransformed, 
necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into 
10 individual flasks to generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line may be treated as an independent transformation event. These 
suspensions can then be subcultured and maintained as clusters of immature embryos or 
regenerated into whole plants by maturation and germination of individual somatic embryos. 

EXAMPLE 7 

15 Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant tryptophan biosynthetic enzymes can be inserted into 
the T7 £. coli expression vector pBT430. This vector is a derivative of pETOa (Rosenberg 
et al. (1987) Gene 5(J:125-135) which employs the bacteriophage T7 RNA polymerase/T7 
promoter system. Plasmid pBT430 was constructed by first destroying the EcoR I and 

20 Hind III sites in pET-3a at their original positions. An oligonucleotide adaptor containing 
EcoR I and Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM 
with additional unique cloning sites for insertion of genes into the expression vector. Then, 
the Nde I site at the position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 

25 5'-CATATGG, was converted to 5'-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve 
GTG"^" low melting agarose gel (FMC). Buffer and agarose contain 10 \xg/ml ethidium 
bromide for visualization of the DNA fragment. The fragment can then be purified from the 

30 agarose gel by digestion with GELase^" (Epicentre Technologies) according to the 

manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 ^iL of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can be 
purified from the excess adapters using low melting agarose as described above. The vector 

35 pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized 
with phenol/chloroform as described above. The prepared vector pBT430 and fragment can 
then be ligated at 16*'C for 15 hours followed by transformation into DH5 electrocompetent 
cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 
100 \xg/mL ampicillin. Transformants containing the gene encoding the tryptophan 
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biosynthetic enzyme are then screened for the correct orientation with respect to the T7 
promoter by restriction enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into £ coli strain BL21(DE3) 

5 (Studier et al. (1986) 1 Mol Biol 759:1 13-130). Cultures are grown in LB medium 

containing ampicillin (100 mg/L) at 25°C. At an optical density at 600 nm of approximately 
1, IPTG (isopropylthio-p-galactoside, the inducer) can be added to a final concentration of 
0.4 mM and incubation can be continued for 3 h at 25°. Cells are then harvested by 
centrifugation and re-suspended in 50 of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM 

10 DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can 
be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the supernatant 
determined. One ^ig of protein from the soluble fraction of the culture can be separated by 
SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating 

15 at the expected molecular weight. 

EXAMPLE 8 

Evaluating Compounds for Their Abilitv to Inhibit the Activity 
of Tryptophan Biosynthetic Enzymes 
The tryptophan biosynthetic enzymes described herein may be produced using any 

20 number of methods known to those skilled in the art. Such methods include, but are not 

limited to, expression in bacteria as described in Example 7, or expression in eukaryotic cell 
culture, in planta, and using viral expression systems in suitably infected organisms or cell 
lines. The instant tryptophan biosynthetic enzymes may be expressed either as mature forms 
of the proteins as observed in vivo or as fusion proteins by covalent attachment to a variety 

25 of enzymes, proteins or affinity tags. Common fusion protein partners include glutathione 
S-transferase ("GST"), thioredoxin ("Trx"), maltose binding protein, and C- and/or 
N-terminal hexahistidine polypeptide ("(His)^")- The fusion proteins may be engineered 
with a protease recognition site at the fusion point so that fusion partners can be separated by 
protease digestion to yield intact mature enzyme. Examples of such proteases include 

30 thrombin, enterokinase and factor Xa. However, any protease can be used which specifically 
cleaves the peptide connecting the fusion protein and the enzyme. 

Purification of the instant tryptophan biosynthetic enzymes, if desired, may utilize any 
number of separation technologies familiar to those skilled in the art of protein purification. 
Examples of such methods include, but are not limited to, homogenization, filtration, 

35 centrifijgation, heat denaturation, ammonium sulfate precipitation, desalting, pH 

precipitation, ion exchange chromatography, hydrophobic interaction chromatography and 
affinity chromatography, wherein the affinity ligand represents a substrate, substrate analog 
or inhibitor. When the tryptophan biosynthetic enzymes are expressed as fusion proteins, the 
purification protocol may include the use of an affinity resin which is specific for the fusion 
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protein tag attached to the expressed enzyme or an affinity resin containing iigands which are 
specific for the enzyme. For example, a tryptophan biosynthetic enzyme may be expressed 
as a fusion protein coupled to the C-terminus of thioredoxin. In addition, a (His)^ peptide 
may be engineered into the N-terminus of the fused thioredoxin moiety to afford additional 

5 opportunities for affinity purification. Other suitable affinity resins could be synthesized by 
linking the appropriate Iigands to any suitable resin such as Sepharose-4B. In an alternate 
embodiment, a thioredoxin fusion protein may be eluted using dithiothreitol; however, 
elution may be accomplished using other reagents which interact to displace the thioredoxin 
from the resin. These reagents include p-mercaptoethanol or other reduced thiol. The 

10 eluted fiision protein may be subjected to ftirther purification by traditional means as stated 
above, if desired. Proteolytic cleavage of the thioredoxin fusion protein and the enzyme may 
be accomplished after the fusion protein is purified or while the protein is still bound to the 
ThioBond^" affinity resin or other resin. 

Crude, partially purified or purified enzyme, either alone or as a fusion protein, may be 

15 utilized in assays for the evaluation of compounds for their ability to inhibit enzymatic 
activation of the tryptophan biosynthetic enzymes disclosed herein. Assays may be 
conducted under well known experimental conditions which permit optimal enzymatic 
activity. For example, assays for anthranilate synthase alpha subunit are presented by 
Bohlmann J et al. (1995) J 7:491-501. Assays for anthranilate synthase beta subunit 

20 are presented by Walker M. S. and DeMoss J. A. (1983) Chem 255:3571-3575. 

Assays for tryptophan synthase alpha subunit are presented by Zhao J and Last R. L. (1995) 
JBiolChem270:60%l-7, 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid fragment encoding all or a substantial portion of an 
anthranilate synthase alpha subunit comprising a member selected from the group consisting 

5 of: 

(a) an isolated nucleic acid fragment encoding all or a substantial portion of 
the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID 
N0:8, SEQ ID NO: 1 0, SEQID NO: 12, SEQ ID NO: 14 and SEQ ID 

10 N0:16; 

(b) an isolated nucleic acid fragment that is substantially similar to an 
isolated nucleic acid fragment encoding all or a substantial portion of 
the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID 

15 N0:8, SEQ ID NO: 10, SEQID NO: 12, SEQ ID NO: 14 and SEQ ID 

N0:16;and 

(c) an isolated nucleic acid fragment that is complementary to (a) or (b). 

2. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment hybridizes imder stringent conditions to an isolated nucleic acid fragment encoding 

20 all or a substantial portion of the amino acid sequence set forth in SEQ ID N0:2. 

3. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment hybridizes under stringent conditions to an isolated nucleic acid fragment encoding 
all or a substantial portion of the amino acid sequence set forth in SEQ ID N0:4. 

4. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
25 fragment hybridizes under stringent conditions to an isolated nucleic acid fragment encoding 

all or a substantial portion of the amino acid sequence set forth in SEQ ID N0:6. 

5. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment hybridizes under stringent conditions to an isolated nucleic acid fragment encoding 
all or a substantial portion of the amino acid sequence set forth in SEQ ID N0:8. 

30 6. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 

fragment hybridizes under stringent conditions to an isolated nucleic acid fragment encoding 
all or a substantial portion of the amino acid sequence set forth in SEQ ID NO: 10. 

7. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment hybridizes under stringent conditions to an isolated nucleic acid fragment encoding 

35 all or a substantial portion of the amino acid sequence set forth in SEQ ID NO: 12. 

8. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment hybridizes under stringent conditions to an isolated nucleic acid fragment encoding 
all or a substantial portion of the amino acid sequence set forth in SEQ ID NO: 14. 
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9. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment hybridizes under stringent conditions to an isolated nucleic acid fragment encoding 
all or a substantial portion of the amino acid sequence set forth in SEQ ID NO: 16. 

10. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
5 fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 

substantial portion of the amino acid sequence set forth in .SEQ ID N0;2. 

1 1 . The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID N0:4. 

10 12. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 

fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID N0:6. 

13. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 

15 substantial portion of the amino acid sequence set forth in SEQ ID NO: 8. 

14. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO: 10. 

15. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
20 fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 

substantial portion of the amino acid sequence set forth in SEQ ID NO: 12. 

16. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 
fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO: 14. 

25 17. The isolated nucleic acid fragment of Claim 1 wherein the isolated nucleic acid 

fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO: 16. 

18. The isolated nucleic acid fragment of Claim 1 wherein the nucleotide sequence 
of the fragment comprises all or a portion of the sequence set forth in a member selected 

30 from the group consisting of SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, 
SEQ ID N0:9, SEQID NO: 1 1 , SEQ ID NO: 1 3 and SEQ ID NO: 1 5 . 

19. A chimeric gene comprising the nucleic acid fragment of Claim 1 operably 
linked to suitable regulatory sequences. 

20. A transformed host cell comprising the chimeric gene of Claim 19. 

35 2 1 . An anthranilate synthase alpha subunit polypeptide comprising all or a 

substantial portion of the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO: 10, 
SEQID NO: 12, SEQ ID NO: 14 and SEQ ID NO: 16. 
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22. An isolated nucleic acid fragment encoding all or a substantial portion of an 
anthranilate synthase beta subunit comprising a member selected from the group consisting 
of: 

(a) an isolated nucleic acid fragment encoding all or a substantial portion of 
5 the amino acid sequence set forth in a member selected from the group 

consisting of SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22 and SEQ 
ID NO:24; 

(b) an isolated nucleic acid fragment that is substantially similar to an 
isolated nucleic acid fragment encoding all or a substantial portion of 

10 the amino acid sequence set forth in a member selected from the group 

consisting of SEQ ID N0:18, SEQ ID NO:20, SEQ ID NO:22 and SEQ 
ID NO:24; and 

(c) an isolated nucleic acid fragment that is complementary to (a) or (b). 

23. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic 
15 acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 

encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO: 18. 

24. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic 
acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence set fr)rth in SEQ ID NO:20. 

20 25. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic 

acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:22. 

26. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic 
acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 

25 encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:24, 

27. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic 
acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO: 18. 

28. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic 
30 acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 

substantial portion of the amino acid sequence set forth in SEQ ID NO:20. 

29. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic \ 
acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO:22. 

35 30. The isolated nucleic acid fragment of Claim 22 wherein the isolated nucleic 

acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO:24. 

3 1 . The isolated nucleic acid fragment of Claim 22 wherein the nucleotide 
sequence of the fragment comprises all or a portion of the sequence set forth in a member 
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selected from the group consisting of SEQ ID N0:17, SEQ ID N0:19, SEQ ID N0:21 and 
SEQ ID NO:23. 

32. A chimeric gene comprising the nucleic acid fragment of Claim 22 operably 
linked to suitable regulatory sequences. 
5 33. A transformed host cell comprising the chimeric gene of Claim 32. 

34. An anthranilate synthase beta subunit polypeptide comprising all or a 
substantial portion of the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO: 18, SEQ IDNO:20, SEQ ID NO:22 and SEQ ID NO:24. 

35. An isolated nucleic acid fragment encoding all or a substantial portion of a 

10 tryptophan synthase alpha subunit comprising a member selected from the group consisting 
of: 

(a) an isolated nucleic acid fragment encoding all or a substantial portion of 
the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:26, SEQ ID NQ:28, SEQ ID NO:30, SEQ ID 

15 NO:32, SEQ ID NO:34, SEQ ID NO:36 and SEQ ID NO:38; 

(b) an isolated nucleic acid fragment that is substantially similar to an 
isolated nucleic acid fragment encoding all or a substantial portion of 
the amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 

20 NO:32, SEQ ID NO:34, SEQ ID NO:36 and SEQ ID NO:38; and 

(c) an isolated nucleic acid fragment that is complementary to (a) or (b). 

36. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:26. 

25 37. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 

acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:28. 

38. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 

30 encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:30. 

39. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:32. 

40. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
35 acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 

encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:34. 

41 . The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:36. 

39 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCtD: <WO 9949058A2_L> 



wo 99/49058 



PCT/US99/06046 



42. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment hybridizes under stringent conditions to an isolated nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:38. 

43. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
5 acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 

substantial portion of the amino acid sequence set forth in SEQ ID NO:26. 

44. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO:28. 

10 45. The isolated nucleic acid fragment of Claim 3 5 wherein the isolated nucleic 

acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO:30. 

46. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 

15 substantial portion of the amino acid sequence set forth in SEQ ID NO:32. 

47. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO:34. 

48. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
20 acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 

substantial portion of the amino acid sequence set forth in SEQ ID NO:36. 

49. The isolated nucleic acid fragment of Claim 35 wherein the isolated nucleic 
acid fragment is at least 80% similar to an isolated nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NO:38. 

25 50. The isolated nucleic acid fragment of Claim 35 wherein the nucleotide 

sequence of the fragment comprises all or a portion of the sequence set forth in a member 
selected from the group consisting of SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ 
ID N0:3 1, SEQ ID NO:33, SEQ ID NO:35 and SEQ ID NO:37. 

51. A chimeric gene comprising the nucleic acid fragment of Claim 35 operably 
30 linked to suitable regulatory sequences. 

52. A transformed host cell comprising the chimeric gene of Claim 5 1 . 

53. A tryptophan synthase alpha subunit polypeptide comprising all or a substantial 
portion of the amino acid sequence set forth in a member selected from the group consisting 
of SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ 

35 ID NO:36 and SEQ ID NO:38. 

54. A method of altering the level of expression of a tryptophan biosynthetic 
enzyme in a host cell comprising: 

(a) transforming a host cell with the chimeric gene of any of Claims 19, 32 
and 5 1 ; and 
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(b) growing the transformed host cell produced in step (a) under conditions 
that are suitable for expression of the chimeric gene 
wherein expression of the chimeric gene results in production of altered levels of a 
tryptophan biosynthetic enzyme in the transformed host cell. 
5 55 . A method of obtaining a nucleic acid fragment encoding all or a substantial 

portion of the amino acid sequence encoding a tryptophan biosynthetic enzyme comprising: 

(a) probing a cDNA or genomic library with the nucleic acid fragment of 
any of Claims 1, 22 and 35; 

(b) identifying a DNA clone that hybridizes with the nucleic acid fragment 
10 of any of Claims 1, 22 and 35; 

(c) isolating the DNA clone identified in step (b); and 

(d) sequencing the cDNA or genomic fragment that comprises the clone 
isolated in step (c) 

wherein the sequenced nucleic acid fragment encodes all or a substantial portion of the 
15 amino acid sequence encoding a tryptophan biosynthetic enzyme. 

56. A method of obtaining a nucleic acid fragment encoding a substantial portion 
of an amino acid sequence encoding a tryptophan biosynthetic enzyme comprising: 

(a) synthesizing an oligonucleotide primer corresponding to a portion of the 
sequence set forth in any ofSEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 

20 19, 21, 23, 25, 27, 29, 31, 33, 35 and 37; and 

(b) amplifying a cDNA insert present in a cloning vector using die 
oligonucleotide primer of step (a) and a primer representing sequences 
of the cloning vector 

wherein the amplified nucleic acid fragment encodes a substantial portion of an amino acid 
25 sequence encoding a tryptophan biosynthetic enzyme. 

57. The product of the method of Claim 55. 

58. The product of the method of Claim 56. 

59. A method for evaluating at least one compound for its ability to inhibit the 
activity of a tryptophan biosynthetic enzyme, the method comprising the steps of: 

30 (a) transforming a host cell with a chimeric gene comprising a nucleic acid 

fragment encoding a tryptophan biosynthetic enzyme, operably linked 
to suitable regulatory sequences; 

(b) growing the transformed host cell under conditions that are suitable for 
expression of the chimeric gene wherein expression of the chimeric 

35 gene results in production of the tryptophan biosynthetic enzyme 

encoded by the operably linked nucleic acid fragment in the 
transformed host cell; 

(c) optionally purifying the tryptophan biosynthetic enzyme expressed by 
the transformed host cell; 
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(d) treating the tryptophan biosynthetic enzyme with a compound to be 
tested; and 

(e) comparing the activity of the tryptophan biosynthetic enzyme that has 
been treated with a test compound to the activity of an untreated 
tryptophan biosynthetic enzyme, 

thereby selecting compounds with potential for inhibitory activity. 
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SEQUENCE LISTING 

<110> E. I. DU PONT DE NEMOURS AND COMPANY 

<120> TRYPTOPHAN BIOSYNTHETIC ENZYMES 

<130> BB-1150 

<140> 
<141> 

<150> 60/079,386 

<151> 1998-03-26 

<160> 41 

<170> Microsoft Word Version 7 . OA 

<210> 1 

<211> 2266 

<212> DNA 

<213> Zea mays 

<220> 

<221> unsure 

<222> (237) 

<220> 

<221> unsure 

<222> (488) 

<220> 

<221> unsure 
<222> (516) 

<220> 

<221> unsure 
<222> (536) 



<400> 1 

ggttcttcct 

cagcccgcat 

cccaggaccc 

cagctgctta 

aatcggccta 

ccctagccgc 

tggttagggc 

gggtgaaatg 

cggcggcnaa 

cgggaagggg 

ctcgctaccg 

ccgtcgagca 

acccagtgat 

gccaagtgac 

gatggcaccc 

tcttttccta 

ctcaggacga 

tcgataatgt 

ctgttgagga 

acaattccaa 

gtacaccttt 

ctaaggaaca 

gacgaacata 

catacatggc 

ttacacgagt 

gcaagacaga 



cccccttcac 

cactcgtctc 

gaatgcccca 

tataagctca 

tagatcaaac 

cacctccgtg 

ggggacggtg 

ctctgctgcc 

ggcggcggag 

aacctggtgc 

ctgcctcgtc 

ggggccccag 

ggagattgtg 

agagcaggta 

acagcagatc 

tgatacggtt 

taggaacctt 

tgagaagaaa 

agcataccaa 

tgtccccaca 

gaacaagtcg 

tattatggct 

tgccaaccca 

gtatgtacag 

cagtaagggg 

gaaggaagat 



cgtaccgaga 

cattaaaatc 

ctcttattac 

ctaaccccgc 

gctgcactgt 

ttcgcgccct 

gtaccaacca 

gtgacgccgc 

gaggacaaga 

ccatgtggga 

cccgaggaca 

ggcaccacca 

gccaaagacc 

gtggacgacc 

gacgagctcc 

aggtatgttg 

cctgatgtgc 

gtatatgtta 

gatggcaggt 

ctctctcctg 

accatgacaa 

gggggaagct 

tttgaggttt 

gcaagaggct 

aagattatta 

caaatgcaag 



cccaaaccaa 

ggttttctcc 

ctgctaattt 

attcgctttt 

agggagcgtg 

cccgcgncgc 

ggcggacgag 

aggcgagccc 

ggcggncttc 

gtgcatcgtg 

acgtcnacgc 

acgtcggccg 

acaaggttac 

cgatgcagat 

ctgaatcctt 

agaagaagaa 

acttgggact 

tccattgggt 

cccgactaaa 

gatttgtgaa 

gtgatgagta 

tccagattgt 

atcgagcatt 

gtgtattggt 

atcgaccact 

agcagcaact 

1 



gccgcctcgc 

ttggaaatct 

tgaatttcct 

ccgactccca 

aagccagcgg 

cgtcccggcg 

cagccggagc 

agtgattagc 

gaggcggcgg 

tcggaccatc 

ccccagcttc 

ctacagcatg 

gatcatggac 

cccgaggacc 

ctccggtgga 

gctaccgttc 

ctatgatgat 

caatgtggac 

catgttgcta 

gctgcacaca 

taagaatgct 

tttaagccag 

acggattgtg 

tgcgtctagt 

tgctggaact 

gttaagtgat 



cagcacccag 

gctggcgagt 

aatcgggttg 

cctccagcca 

cgaatggaat 

gcgcgggccc 

ggaaccagcg 

aggagcgctg 

cgcggnggaa 

tacccccgtg 

ctcttcgagt 

gtgggagccc 

cacgagaaga 

atgatggagg 

tgggttgggt 

tccagtgctc 

gttctagtct 

cggcatgcat 

tctaaagtgc 

cgcaagtttg 

gttctgcagg 

aggttcgaga 

aatcctagcc 

cctgaaattc 

gttcgaaggg 

gaaaaacagt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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gtgccgagca cataatgctt gtggacttgg gaaggaatga tgttggcaag gtatccaaac 1620 

caggatcagt gaaggtggag aagttgatga acattgagag atactcccat gttatgcaca 1680 

tcagctcaac ggttagtgga cagttggatg atcatctcca gagttgggat gccttgagag 1740 

ctgccttgcc cgttggaaca gtcagtggtg caccaaaggt gaaggccatg gagttgattg 1800 

ataagttgga agttacgagg cgaggaccat atagtggtgg tctaggagga atatcgtttg 1860 

atggtgacat gcaaattgca ctttctctcc gcaccatcgt attctcaaca gcgccgagcc 1920 

acaacacgat gtactcatac aaagacgcag ataggcgtcg ggagtgggtc gctcatcttc 1980 

aggctggtgc aggcattgtt gccgacagta gcccagatga cgaacaacgt gaatgcgaga 2040 

ataaggctgc tgcactagct cgggccatcg atcttgcaga gtcagctttt gtagacaaag 2100 

aatagtgtgc tatggttatc gtttagttct tgttcatgtt tc-ttttaccc actttccgtt 2160 

aaaaaaagat gtcattagtg ggtggagaaa agcaataaga ctgttctcta gagaaccgaa 2220 

gaaatatgga aattgaggtt atggcttaaa aaaaaaaaaa aaaaaa 2266 

<210> 2 
<211> 603 
<212> PRT 
<213> Zea mays 



<220> 

<221> UNSURE 

<222> (15) 

<220> 

<221> UNSURE 

<222> (75) 

<220> 

<221> UNSURE 

<222> (115) 



<400> 2 

Met Glu Ser Leu Ala Ala Thr Ser Val Phe Ala Pro Ser Arg Xaa Ala 
15 10 15 

Val Pro Ala Ala Arg Ala Leu Val Arg Ala Gly Thr Val Val Pro Thr 
20 25 30 

Arg Arg Thr Ser Ser Arg Ser Gly Thr Ser Gly Val Lys Cys Ser Ala 
35 40 45 

Ala Val Thr Pro Gin Ala Ser Pro Val He Ser Arg Ser Ala Ala Ala 
50 55 60 

Ala Lys Ala Ala Glu Glu Asp Lys Arg Arg Xaa Ser Arg Arg Arg Arg 
65 70 75 80 

Gly Gly Thr Gly Arg Gly Thr Trp Cys Pro Cys Gly Ser Ala Ser Cys 
85 90 95 

Arg Thr He Tyr Pro Arg Ala Arg Tyr Arg Cys Leu Val Pro Glu Asp 
100 105 110 

Asn Val Xaa Ala Pro Ser Phe Leu Phe Glu Ser Val Glu Gin Gly Pro 
115 120 125 

Gin Gly Thr Thr Asn Val Gly Arg Tyr Ser Met Val Gly Ala His Pro 
130 135 140 

Val Met Glu He Val Ala Lys Asp His Lys Val Thr He Met Asp His 
145 150 155 160 

Glu Lys Ser Gin Val Thr Glu Gin Val Val Asp Asp Pro Met Gin He 
165 170 175 



Pro Arg Thr Met Met Glu Gly Trp His Pro Gin Gin He Asp Glu Leu 
180 185 190 



2 
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Pro Glu Ser Phe Sex Gly Gly Trp Val Gly Phe Phe Ser Tyr Asp Thr 
195 200 205 

Val Arg Tyr Val Glu Lys Lys Lys Leu Pro Phe Ser Ser Ala Pro Gin 
210 215 220 

ASP Asp Arg Asn Leu Pro Asp Val His Leu Gly Leu Tyr Asp Asp Val 
225 230 235 240 

Leu Val Phe Asp Asn Val Glu Lys Lys Val Tyr Val He His Trp Val 
245 250 255 

Asn Val Asp Arg His Ala Ser Val Glu Glu Ala Tyr Gin Asp Gly Arg 
260 265 270 

Ser Arg Leu Asn Met Leu Leu Ser Lys Val His Asn Ser Asn Val Pro 
275 280 285 

Thr Leu Ser Pro Gly Phe Val Lys Leu His Thr Arg Lys Phe Gly Thr 
290 295 300 

Pro Leu Asn Lys Ser Thr Met Thr Ser Asp Glu Tyr Lys Asn Ala Val 
305 310 315 320 

Leu Gin Ala Lys Glu His He Met Ala Gly Gly Ser Phe Gin He Val 
325 330 335 

Leu Ser Gin Arg Phe Glu Arg Arg Thr Tyr Ala Asn Pro Phe Glu Val 
340 345 350 

Tyr Arg Ala Leu Arg He Val Asn Pro Ser Pro Tyr Met Ala Tyr Val 
355 360 365 

Gin Ala Arg Gly Cys Val Leu Val Ala Ser Ser Pro Glu He Leu Thr 
370 375 380 

Arq Val Ser Lys Gly Lys He He Asn Arg Pro Leu Ala Gly Thr Val 
385 390 395 400 

Arg Arg Gly Lys Thr Glu Lys Glu Asp Gin Met Gin Glu Gin Gin Leu 
405 410 415 

Leu Ser Asp Glu Lys Gin Cys Ala Glu His He Met Leu Val Asp Leu 
420 425 430 

Gly Arg Asn Asp Val Gly Lys Val Ser Lys Pro Gly Ser Val Lys Val 
435 440 445 

Glu Lys Leu Met Asn He Glu Arg Tyr Ser His Val Met His He Ser 
450 455 460 

Ser Thr Val Ser Gly Gin Leu Asp Asp His Leu Gin Ser Trp Asp Ala 
465 470 475 480 

Leu Arg Ala Ala Leu Pro Val Gly Thr Val Ser Gly Ala Pro Lys Val 
485 490 495 

Lys Ala Met Glu Leu He Asp Lys Leu Glu Val Thr Arg Arg Gly Pro 
500 505 510 

Tyr Ser Gly Gly Leu Gly Gly He Ser Phe Asp Gly Asp Met Gin He 
515 520 525 

Ala Leu Ser Leu Arg Thr He Val Phe Ser Thr Ala Pro Ser His Asn 
530 535 540 
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Thr Met Tyr Ser Tyr Lys Asp Ala Asp Arg Arg Arg Glu Trp Val Ala 
545 550 555 560 

His Leu Gin Ala Gly Ala Gly He Val Ala Asp Ser Ser Pro Asp Asp 
565 570 575 

Glu Gin Arg Glu Cys Glu Asn Lys Ala Ala Ala Leu Ala Arg Ala He 
580 585 590 

Asp Leu Ala Glu Ser Ala Phe Val Asp Lys Glu 
595 500 



<210> 3 

<211> 1563 

<212> DNA 

<213> Zea mays 



<220> 

<221> unsure 
<222> (116) 



<400> 3 

ggccaaggct aaccatgtga cgggtcatgg 
tcgtgcctga tcccatgagg atccccagga 
ctgacagcct ccctgatgca ttttgtggag 
tgcgttatgt tgaaacaaag aagcttcctt 
ttcctgacat tcatttaggc ctctatagtg 
aaacacatgt tattcattgg gtgaggacac 
aagatggaag aaatcggctt gaagctttgt 
cactttcttc tggttctata aaactcaatg 
caacgatgtc aagcgaagaa tataaaaata 
ccggtgacat tttccaagtt gttttaagcc 
cttttgaaat ctatcgtgca ttgcgcatcg 
aggcacgagg ttgtattctc gtggcatcga 
ggacaataat caatcgtccg cttgctggaa 
acaaaacttt agaacaattg cttttgagtg 
tagtagatct tggccgaaat gatgttggga 
agaaattgat gaatatcgaa cgatattctc 
gagagctacg cgatgatctt acgtgttggg 
ccgttagtgg cgctccaaag gtgagagcaa 
tgcgtgggcc gtatagtggt ggttttggag 
cactcgctct tcgcactatt gtcttcccca 
acacagacag taagtcccgg caggagtggg 
ttgctgatag caaaccggat gacgagcacc 
ctcgtgccat tgaccttgct gaatcgacat 
aaatgtatgt ttaagttctc tgtataatta 
gccgaataaa aactcaactg taataaagtt 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaa 



accatgagat gaagtcgagg agggagcact 60 

caatcatgga gcagtggaac ccgcanattg 120 

gatgggttgg attcttctca tatgatacag 180 

tcagtaaggc accacatgac gataggaacc 240 

acgtcattgt gtttgatcat gttgaaaaga 300 

actgctatcg ttctgttgat gaagcatatg 360 

tatcaagatt acattgcctc aatgtcccaa 420 

ttgaaaactt tggcccagta atgcaaaaat 480 

tcgttgtcca agctaaagaa cacatcttgg 540 

agcgttttga gagacggaca ttcgccgacc 600 

taaatcctag tccatatatg gcctatctac 660 

gtcctgaaat tcttacccgg gtacaaaaga 720 

ccataagaag aggcaaaaca aaagcagaag 780 

atgaaaagca gtgtgctgaa catattatgc 840 

aggtgtccaa accaggttca gtaaaggtag 900 

atgtcatgca catcagctca acagtaactg 960 

atgcgctacg agccgcattg ccagttggaa 1020 

tggagttgat tgatcagcta gaagtgagta 1080 

ggatttcctt ccgcggcgac atggacattg 1140 

ccgcatctag gtttgatacc atgtactcgt 1200 

tggctcacct ccaggccgga gctggcatag 1260 

aagagtgtat aaacaaggct gcaggtgttg 1320 

ttcttgaaga gtagtctagt ctaatgaagg 1380 

tggattgtcc tagaaaacag gctttcttag 1440 

aataaatgga caactttagc taaaaaaaaa 1500 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1560 

1563 



<210> 4 

<211> 446 

<212> PRT 

<213> Zea mays 

<220> 

<221> UNSURE 

<222> (31) 



<400> 4 

Val Met Asp His Glu Met Lys Ser 
1 5 

Pro Met Arg He Pro Arg Thr He 
20 



Arg Arg Glu His Phe Val Pro Asp 
10 15 

Met Glu Gin Trp Asn Pro Xaa He 
25 30 



4 
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Ala Asp Ser Leu Pro Asp Ala Phe Cys Gly Gly Trp Val Gly Phe Phe 
35 40 45 

Ser Tyr Asp Thr Val Arg Tyr Val Glu^Thr Lys Lys Leu Pro Phe Ser 
50 55 60 

Lys Ala Pro His Asp Asp Arg Asn Leu Pro Asp He His Leu Gly Leu 
65 70 75 80 

Tyr Ser Asp Val He Val Phe Asp His Val Glu Lys Lys Thr His Val 
85 90 95 

He His Trp Val Arg Thr His Cys Tyr Arg Ser Val Asp Glu Ala Tyr 
100 105 110 

Glu Asp Gly Arg Asn Arg Leu Glu Ala Leu Leu Ser Arg Leu His Cys 
115 120 125 

Leu Asn Val Pro Thr Leu Ser Ser Gly Ser He Lys Leu Asn Val Glu 
130 135 140 

Asn Phe Gly Pro Val Met Gin Lys Ser Thr Met Ser Ser Glu Glu Tyr 
145 150 155 160 

Lys Asn He Val Val Gin Ala Lys Glu His He Leu Ala Gly Asp He 
165 170 175 

Phe Gin Val Val Leu Ser Gin Arg Phe Glu Arg Arg Thr Phe Ala Asp 
180 185 190 

Pro Phe Glu He Tyr Arg Ala Leu Arg He Val Asn Pro Ser Pro Tyr 
195 200 205 

Met Ala Tyr Leu Gin Ala Arg Gly Cys He Leu Val Ala Ser Ser Pro 
210 215 220 

Glu He Leu Thr Arg Val Gin Lys Arg Thr He He Asn Arg Pro Leu 
225 230 235 240 

Ala Gly Thr He Arg Arg Gly Lys Thr Lys Ala Glu Asp Lys Thr Leu 
245 250 255 

Glu Gin Leu Leu Leu Ser Asp Glu Lys Gin Cys Ala Glu His He Met 
260 265 270 



Leu Val Asp Leu Gly Arg Asn Asp Val Gly Lys Val Ser Lys Pro Gly 
275 280 285 

Ser Val Lys Val Glu Lys Leu Met Asn He Glu Arg Tyr Ser His Val 
290 295 300 

Met His He Ser Ser Thr Val Thr Gly Glu Leu Arg Asp Asp Leu Thr 
305 310 315 320 

Cys Trp Asp Ala Leu Arg Ala Ala Leu Pro Val Gly Thr Val Ser Gly 
325 330 335 

Ala Pro Lys Val Arg Ala Met Glu Leu He Asp Gin Leu Glu Val Ser 
340 345 350 

Met Arg Gly Pro Tyr Ser Gly Gly Phe Gly Gly He Ser Phe Arg Gly 
355 360 365 

Asp Met Asp He Ala Leu Ala Leu Arg Thr He Val Phe Pro Thr Ala 
370 375 380 
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Ser Arg Phe Asp Thr Met Tyr Ser Tyr Thr Asp Ser Lys Ser Arg Gin 
385 390 395 400 

Glu Trp Val Ala His Leu Gin Ala Gly Ala Gly He Val Ala Asp Ser 
405 410 415 

Lys Pro Asp Asp Glu His Gin Glu Cys He Asn Lys Ala Ala Gly Val 
420 425 430 

Ala Arg Ala He Asp Leu Ala Glu Ser Thr Phe Leu Glu Glu 
435 440 445 

<210> 5 

<211> 339 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> unsure 
<222> (132) 

<220> 

<221> unsure 
<222> (147) 

<220> 

<221> unsure 
<222> (246) 

<220> 

<221> unsure 
<222> (251) 

<220> 

<221> unsure 
<222> (326) 

<220> 

<221> unsure 
<222> (332) 

<400> 5 

agatcagttc cacggtgagt ggagagttgg atgatcatct ccaaagttgg gatgccctgc 60 

gagccgcgtt gcctgttgga acagttagtg gagcaccaaa ggtgaaagcc atggagctga 120 

tagacgagct anaggtcaca agacgangac atacagtggc gggccttgga ggggatatca 180 

ttttgacggg agacaatgct tatcgctcct tggcactccc gcaaccaatt tgtgttctca 240 

acaagncgcc naagccaaca acaacgaatg ttctccatcc aaacgacaac cgagagggcc 300 

ccggggaagt gggtcccctc aacttnaagg cnggtggct 339 

<210> 6 

<211> 57 

<212> PRT 

<213> Oryza sativa 

<220> 

<221> UNSURE 
<222> (44) 

<220> 

<221> UNSURE 
<222> (49) 

<400> 6 

He Ser Ser Thr Val Ser Gly Glu Leu Asp Asp His Leu Gin Ser Trp 
15 10 15 

6 
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Asp Ala Leu Arg Ala Ala Leu Pro Val Gly Thr Val Ser Gly Ala Pro 
20 25 30 

Lys Val Lys Ala Met Glu Leu He Asp Glu Leu Xaa Val Thr Arg Arg 
35 40 45 

Xaa His Thr Val Ala Gly Leu Gly Gly 
50 55 

<210> 7 

<211> 513 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (26) 

<220> 

<221> unsure 
<222> (30) 

<220> 

<221> unsure 
<222> (79) 

<220> 

<221> unsure 
<222> (231) 

<220> 

<221> unsure 
<222> (323) 

<220> 

<221> unsure 
<222> (343) 

<220> 

<221> unsure 
<222> (426) 

<220> 

<221> unsure 
<222> (467) 

<220> 

<221> unsure 
<222> (482) 

<220> 

<221> unsure 
<222> (485) 

<220> 

<221> unsure 
<222> (491) 

<220> 

<221> unsure 
<222> (506) 

<400> 7 

atttctctta tgatacaatg cgctangtan aaaagaagaa acttccattt tctaatgccc 60 
cagtagatga cagaaaccnt cctgatgttc atctgggcct ttatgacagt gtgattgtgt 120 

7 
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ttgatcatgt 
tcaagctgag 
gcatgatata 
tggtcctaaa 
ggctaaagag 
ttttgnacaa 
antgnttaat 



tgaaaaagaa 
gaggccttcg 
attaccccaa 
ctggagatgt 
cacaaaacgg 
gagaactttt 
ncccaagtcc 



agcatatgtg 
aagatggaag 
ggctgcctac 
canacatgac 
gctgggtgat 
tgcaggaccc 
catatnattg 



attcattggg 
gaaccggctg 
aggttcgata 
aaatgaggag 
atttttcaaa 
aatttgaaat 
act 



ttcgggtgga 
gaaactctag 
aagttataca 
tanaagaggg 
attgtactaa 
ctacaanagg 



tcgatattct 180 
natctcgggt 240 
ctcgtctctt 300 
cagtattgaa 360 
agtcaaacgt 420 
cattgagggg 480 
513 



<210> 8 

<211> 123 

<212> PRT 

<213> Glycine max 



<220> 

<221> UNSURE 
<222> (8) 



<220> 

<221> UNSURE 
<222> (10) 



<220> 

<221> UNSURE 

<222> (26) 

<220> 

<221> UNSURE 

<222> (76) 



<220> 

<221> UNSURE 

<222> (107) 

<220> 

<221> UNSURE 

<222> (113) 



<400> 8 
Phe Ser Tyr Asp 
1 



Ser Asn Ala Pro 
20 



Leu Tyr Asp Ser 
35 



Val He His Trp 
50 



Phe Glu Asp Gly 
65 



Asp He He Thr 



Arg Leu Phe Gly 
100 

Xaa Lys Arg Ala 
115 



Thr Met Arg Xaa 
5 

Val Asp Asp Arg 



Val He Val Phe 
40 

Val Arg Val Asp 
55 

Arg Asn Arg Leu 
70 

Pro Arg Leu Pro 
85 

Pro Lys Leu Glu 



Val Leu Lys Ala 
120 



Val Xaa Lys Lys 
10 

Asn Xaa Pro Asp 
25 

Asp His Val Lys 



Arg Tyr Ser Ser 
60 

Glu Thr Leu Xaa 
75 

Thr Gly Ser He 
90 

Met Ser Xaa Met 
105 

Lys Glu His 



Lys Leu Pro Phe 
15 

Val His Leu Gly 
30 

Lys Lys Ala Tyr 
45 

Ala Glu Glu Ala 



Ser Arg Val His 
80 

Lys Leu Tyr Thr 
95 

Thr Asn Glu Glu 
110 



<210> 9 

<211> 507 

<212> DNA 

<213> Glycine max 

8 
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wo 99/49058 



PCT/US99/06046 



<220> 

<221> unsure 
<222> (405) 



<220> 

<221> unsure 
<222> (414) 



<220> 

<221> unsure 
<222> (479) 



<220> 

<221> unsure 
<222> (485) . . (486) 



<400> 9 

agaagaatga ccttttcctt ccttcgtcct 
ctcgcaagtg taggttttgc taaccgaacc 
tgcgctcaat ctccttctcc ttcactagtt 
aagaagggga acgtcattcc tctcttccgc 
cttgcgtacc ggtgcctggt taaggaggac 
tcggtcgagc caggccaaat ttctagcatc 
gtgtatggga aattgtggcg aaagagaacg 
ggcgcaagga gtgaggaaat tgtaaaggtt 
atggnnaagt tggacgcctc aaatcct 



ccaatggcga ctgttccgca cccattatcc 60 
tcctccatct ccagatccac tctcaaatgc 120 
gacaacgccc agaagtttct cgaagcttcc 180 
tgcatatttt ccgatcacct cactccggtg 240 
gagagagatg ctccgagttt tctctttgaa 300 
ggacggtaca gtgtggttgg agcacaagcc 360 
tgggttacta ttaanggaca acgntggaag 420 
cccccgggtg aattcctccg taaggggtna 480 

507 



<210> 10 

<211> 130 

<212> PRT 

<213> Glycine max 



<220> 

<221> UNSURE 
<222> (120) 



<400> 10 
Met Thr Phe Ser 
1 

Leu Ser Leu Ala 
20 

Arg Ser Thr Leu 
35 

Asp Asn Ala Gin 
50 

Pro Leu Phe Arg 
65 

Tyr Arg Cys Leu 



Phe Giu Ser Val 
100 

Val Val Gly Ala 
115 



Phe Leu Arg Pro 
5 

Ser Val Gly Phe 



Lys Cys Cys Ala 
40 

Lys Phe Leu Glu 

55 ^ 

Cys lie Phe Ser 
70 

Val Lys Glu Asp 
85 

Glu Pro Gly Gin 



Gin Ala Val Xaa 
120 



Pro Met Ala Thr 
10 

Ala Asn Arg Thr 
25 

Gin Ser Pro Ser 



Ala Ser Lys Lys 
60 

Asp His Leu Thr 
75 

Glu Arg Asp Ala 
90 

He Ser Ser He 
105 

Trp Glu He Val 



Val Pro His Pro 
15 

Ser Ser He Ser 
30 

Pro Ser Leu Val 
45 

Gly Asn Val He 



Pro Val Leu Ala 
80 



Pro Ser Phe Leu 
95 



Gly Arg Tyr Ser 
110 

Ala Lys Glu Asn 
125 



Val Gly 
130 



9 
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wo 99/49058 



PCT/US99/06046 



<210> 11 

<211> 452 

<212> DNA 

<213> Triticum aestivum 



<400> 11 

gcacgagcca 

cagtccgcta 

accttcctga 

agaaagtata 

acaaagatgg 

ccaagctctc 

aatcgaccat 

tgggcggtaa 



gctccctgag 
tgtcgaaaag 
tgttcacttg 
tgtcatccat 
caggtcccgg 
tccaggattt 
gacaagtgat 
tattttccag 



gccttcagcg 
aagaagatac 
gggctttacg 
tgggtaagtg 
ttgaagcggt 
gtgaagctac 
gagtacaaga 
attgttttaa 



gtggatgggt 
ctttctctgg 
atgatgttct 
tggaccggca 
tgctttctaa 
atactcggca 
gtgctgttat 
gc 



tgggttcttt 
tgctccccac 
tgtcttcgac 
tgcatccacc 
agttcacaat 
gtttggtact 
gcaggctaag 



<210> 12 

<211> 150 

<212> PRT 

<213> Triticum aestivum 



tcctatgata 60 
gatgatagga 120 
aatgttgaga 180 
gaggaagcat 240 
gcaaatgtcc 300 
cccttgaaca 360 
gagcatattc 420 
452 



<400> 12 

Thr Ser Gin Leu Pro Glu Ala Phe Ser Gly Gly 
1 5 10 

Ser Tyr Asp Thr Val Arg Tyr Val Glu Lys Lys 
20 25 

Gly Ala Pro His Asp Asp Arg Asn Leu Pro Asp 
35 40 

Tyr Asp Asp Val Leu Val Phe Asp Asn Val Glu 
50 55 



Trp Val Gly Phe Phe 
15 

Lys lie Pro Phe Ser 
30 

Val His Leu Gly Leu 
45 

Lys Lys Val Tyr Val 
60 



lie His Trp Val Ser Val Asp Arg His Ala Ser 
65 70 75 



Thr Glu Glu Ala Tyr 
80 



Lys Asp Gly Arg Ser Arg Leu Lys Arg Leu Leu 
85 90 



Ser Lys Val His Asn 
95 



Ala Asn Val Pro Lys Leu Ser Pro Gly Phe Val 
100 105 



Lys Leu His Thr Arg 
110" 



Gin Phe Gly Thr Pro Leu Asn Lys Ser Thr Met 

115 120 

Lys Ser Ala Val Met Gin Ala Lys Glu His He 
130 135 



Thr Ser Asp Glu Tyr 
125 

Leu Gly Gly Asn He 
140 



Phe Gin He Val Leu Ser 
145 150 

<210> 13 

<211> 504 

<212> DNA 

<213> Triticum aestivum 



<400> 13 

cccaaacagt ggtggcttag gagggatatc atttgatggt gacatgctta 
tctccgcacc attgtgtttt caacagctcc aagccccaat aggatgtact 
ctcagatagg ccccgagagt gggttgctca tcttcaggct ggtgcgggca 
tagtatccca gacgatgagc aaaaagaatt tgagaataag gcggctgccc 
aattgatctt gcagagtcgg cttttttaga caaagaatag agtgtctatt 
ttttagttgt tcatcatttt tcacccagtt cattttggaa agttgttcat 
ccgagttcat attggggaaa aaaagcaata ccgttttgtt gtcctttgaa 
ttgagctata ataagatgta ttttgctcat cgggcaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aata 



tcgctcttgc 60 
catacaaaag 120 
ttgttgctga 180 
tagctcgggc 240 
aaattatttt 300 
cgttttttca 360 
atgaataaat 420 
aatataaaaa 480 
504 



10 
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BNSOOCIO: <WO 9949058A2J^ 



wo 99/49058 



PCT/US99/06046 



<210> 14 

<211> 92 

<212> PRT 

<213> Triticum aestivum 

<400> 14 

Pro Asn Ser Gly Gly Leu Gly Gly He Ser Phe Asp Gly Asp Met Leu 

1 5 10 15 

He Ala Leu Ala Leu Arg Thr He Val Phe Ser Thr Ala Pro Ser Pro 
20 25 30 

Asn Arg Met Tyr Ser Tyr Lys Ser Ser Asp Arg Pro Arg Glu Trp Val 
35 40 45 

Ala His Leu Gin Ala Gly Ala Gly lie Val Ala Asp Ser He Pro Asp 
50 55 60 

Asp Glu Gin Lys Glu Phe Glu Asn Lys Ala Ala Ala Leu Ala Arg Ala 
65 70 75 80 

He Asp Leu Ala Glu Ser Ala Phe Leu Asp Lys Glu 
85 90 



<210> 


15 






<211> 


535 






<212> 


DNA 






<213> 


Triticum aestivum 


<220> 








<221> 


unsure 




<222> 


(483) 




<220> 








<221> 


unsure 




<222> 


(511} 




<220> 








<221> 


unsure 




<222> 


(5201 . . (521) 




<400> 


15 






ctgctttggg 


attcaactcc 


caccaccacc 


catcccatct 


cgtatatata 


cgcgccctcc 


agagcagtgt 


cagaccgttg 


ttgagcagga 


tccctagccg 


ccgccacgtt 


ctcgccctcg 


gctgcggcag 


cgggcagatc 


gagggcggtg 


ggcctgaggt 


gctcgtcggc 


gagcgcgacc 


gcggaggagg 


aggacaggag 


gcgcttcttt 


aacctggtgc 


ccgtgtggga 


gtgcatcgtg 


cgntgcctcg 


ttcccgaagg 


acgacattgg 


<210> 


16 






<211> 


108 






<212> 


PRT 






<213> 


Triticum aestivum 


<220> 








<221> 


UNSURE 




<222> 


(102) 




<400> 


16 






Met Glu Ser Leu Ala Ala Ala Thr 


1 




5 





acctctaaac 
caatctcttt 
agcggaagga 
cgcctcgctg 
gcggcaacga 
ccggtgatca 
gaggcggcgg 
tcggattacc 
ncaaccccan 



tcaaccggat 
ctccgagcgc 
cagttcaatc 
cccaccccgc 
ggaggaggag 
atgggagcgc 
cacgggggac 
tcacccccgt 
nttcctcttc 



tgcagcttct 
gaaatctgac 
cgcaatggaa 
cccgcccgcg 
gagcagcagc 
cgccgcgaag 
cggcaagggc 
gctcgcctaa 
caagg 



60 
120 
180 
240 
300 
360 
420 
480 
535 



10 



15 
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His Pro Ala Pro Pro Ala Ala Ala 
20 



Ala Ala Thr Arg Arg Arg Arg Ser 
35 40 

Ala Ser Ala Thr Pro Vai He Asn 
50 55 

Glu Glu Asp Arg Arg Arg Phe Phe 
65 70 

Lys Giy Asn Leu Val Pro Val Trp 
85 



Thr Pro Val Leu- Ala Xaa Arg Cys 
100 



Ala Ala Giy Arg Ser Arg Ala Vai 
25 30 

Ser Ser Giy Leu Arg Cys Ser Ser 
45 

Giy Ser Ala Ala Ala Lys Ala Glu 
60 

Glu Ala Ala Ala Arg Giy Thr Giy 
75 80 

Glu Cys He Val Ser Asp Tyr Leu 
90 95 

Leu Val Pro Glu 
105 



<210> 17 

<211> 797 

<212> DNA 

<213> Zea mays 

<400> 17 

gcacgaggaa attaagaaga tttctcctag 
tcctcaagat tcaggaatat ccttgcaaac 
gtttggggtt tgcatgggtt tacagtgcat 
atccccttac ggcgttgtgc acggaaaggg 
tggaactctg ttttatgata ttcctaaccc 
aatcgagaag gatagtttcc cgcatgatac 
gttgatcatg gctgctcgcc acaggatata 
ggagagcatc ataactactg aagggaggct 
aggctacgaa gcctcgaatt gctctccgtg 
gcgtggggcg ggtgacaata agatgatgta 
aaactcagat gtatatgttc ggatgtcaca 
acatggtcgc atgtggcgtg tcgatgtcaa 
cgctatattt gtttttttac agtagttacc 
aaaaaaaaaa aaaaaaa 



aggaatactt atttccccag gccctggcac 60 
agttacagag cttgggccct caataccgtt 120 
tggagaggca tttggaggga aggttgttcg 180 
ctcccttgtt cattatgatg agaaacttga 240 
attccaggct ggaaggtatc acagccttgt 300 
cctggaaatt gttgcgtgga cagatgatgg 360 
caaacatatt cagggcgtgc agttccatcc 420 
catggtcaac aattttatca agattattga 480 
atttatctag ttggaagcag cagcagccct 540 
aaagcttgta actacatagc tcgaacatgt 600 
aactcacgac actcagagtc caactactgt 660 
atcaaatcat tgaactttta gctcggtgac 720 
actatatttg ttcatttggc aaaaaaaaaa 780 

797 



<210> 18 

<211> 169 

<212> PRT 

<213> Zea mays 



<400> 18 
His Glu Glu lie 
1 

Giy Pro Giy Thr 
20 



Glu Leu Giy Pro 
35 

Cys He Giy Glu 
50 

Vai Val His Giy 
65 

Giy Thr Leu Phe 



His Ser Leu Val 
100 



Lys Lys lie Ser 
5 

Pro Gin Asp Ser 

Ser lie Pro Leu 
40 

Ala Phe Giy Giy 
55 

Lys Giy Ser Leu 
70 

Tyr Asp lie Pro 
85 

He Giu Lys Asp 



Pro Arg Giy He 
10 

Giy He Ser Leu 
25 

Phe Giy Val Cys 



Lys Val Vai Arg 
60 



Vai His Tyr Asp 
75 

Asn Pro Phe Gin 
90 

Ser Phe Pro His 
105 



Leu He Ser Pro 
15 

Gin Thr Val Thr 
30 

Met Giy Leu Gin 
45 

Ser Pro Tyr Giy 



Giu Lys Leu Asp 
80 

Ala Giy Arg Tyr 
95 

Asp Thr Leu Giu 
110 
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He Val Ala Trp Thr Asp Asp Gly Leu He Met Ala Ala Arg His Arg 
115 120 125 

He Tyr Lys His He Gin Gly Val Gin Phe His Pro Glu Ser He He 
130 135 140 

Thr Thr Glu Gly Arg Leu Met Val Asn Asn Phe He Lys He He Glu 
145 150 155 160 

Gly Tyr Glu Ala Ser Asn Cys Ser Pro 
165 



<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 



19 

1133 
DNA 

Zea mays 



unsure 
(13) 



<400> 19 

ctaccggttc 

ccgcagcggc 

cgacgcctcg 

tggtcgccgc 

caacggagaa 

gccagtatat 

tagaagatgt 

cacaagattc 

ttggagtttg 

ctccttctgg 

aggcattgtt 

ttgagcaaga 

ttatcatggc 

agagcatcat 

aactggagaa 

cagacggaag 

gcatagctag 

tgtgcggacc 

tggtgctcta 



cgnccagatg 
ggctcgttcc 
tttcgcgagc 
cgcggggccg 
gcagcccatc 
gggggagctt 
gagaaggaag 
gggaatatca 
catgggtctg 
agtgatgcat 
caatggcttg 
aaccttccca 
tgctcgccac 
cacccctgaa 
gcagcgttcg 
cagagacaag 
gaaacagctt 
gaatttagaa 
tagatgaata 



gcctgctccc 
ccggcgcatt 
gctggcttgt 
gccgcggcgg 
atcgtcatcg 
ggattgaact 
aacccaaggg 
ttgcagactg 
caatgcattg 
gggaaaagct 
ccaaaccctt 
catgatgctt 
aagaagtaca 
ggcaagaaaa 
taggggaggt 
gcgcttaaag 
tccctctctt 
taaagtacag 
tatccggtga 



acatcgccgc 
ctcccgcttc 
cggttaaggg 
caccggtggc 
acaactacga 
tcgaagtata 
gaatacttat 
ttcttgaatt 
gagaggcatt 
ctccagttta 
ttactgccgc 
tggaggctac 
aacacatcca 
tcatcctcaa 
agacgcgaca 
ctgcgccagg 
aattcgttgt 
gcgtccgtct 
ttttcggcaa 



cgcgggggtc 
cgccttcgcg 
aaacggagcg 
cgacctggac 
cagcttcaca 
ccgcaatgat 
ttctccagga 
aggcccaacc 
tggaggaaag 
ttacgatgag 
gaggtaccac 
tgcatggact 
gggtgtccaa 
ctttgtcaga 
ggtggtttca 
ttccggggct 
gttcgtggtg 
gatcaaataa 
aaaaaaaaaa 



tcctcccccg 
cgcctccggt 
gcgttcccgt 
ggccgcccgg 
tacaacctct 
gaactgacca 
cctggtgaac 
atcccaattt 
attatccgtg 
gaattaggaa 
agcttggtca 
gaagatggac 
ttccacccgg 
ttcattgagg 
tttcatagat 
ggcagttgaa 
acataatctg 
gcaccggtgt 
aaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1133 



<210> 20 

<211> 281 

<212> PRT 

<213> Zea mays 

<400> 20 

Met Ala Cys Ser His He Ala Ala Ala Gly Val Ser Ser Pro Ala Ala 
1 5 10 -15 

Ala Ala Ala Arg Ser Pro Ala His Ser Pro Ala Ser Ala Phe Ala Arg 
20 25 30 

Leu Arg Ser Thr Pro Arg Phe Ala Ser Ala Gly Leu Ser Val Lys Gly 
35 40 45 

Asn Gly Ala Ala Phe Pro Leu Val Ala Ala Ala Gly Pro Ala Ala Ala 
50 55 60 

Ala Pro Val Ala Asp Leu Asp Gly Arg Pro Ala Thr Glu Lys Gin Pro 
65 70 75 80 

He He Val He Asp Asn Tyr Asp Ser Phe Thr Tyr Asn Leu Cys Gin 
85 90 95 
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Tyr Met Gly Glu Leu Giy Leu Asn Phe Glu Val Tyr Arg Asn Asp Glu 
100 105 110 

Leu Thr lie Glu Asp Val Arg Arg Lys Asn Pro Arg Gly He Leu He 
115 120 125 

Ser Pro Gly Pro Gly Glu Pro Gin Asp Ser Gly He Ser Leu Gin Thr 
130 135 140 

Val Leu Glu Leu Gly Pro Thr He Pro He Phe Gly Val Cys Met Gly 
145 150 155 160 

Leu Gin Cys He Gly Glu Ala Phe Gly Gly Lys He He Arg Ala Pro 
165 170 175 

Ser Gly Val Met His Gly Lys Ser Ser Pro Val Tyr Tyr Asp Glu Glu 
180 185 190 

Leu Gly Lys Ala Leu Phe Asn Gly Leu Pro Asn Pro Phe Thr Ala Ala 
195 200 205 

Arg Tyr His Ser Leu Val He Glu Gin Glu Thr Phe Pro His Asp Ala 
210 215 220 

Leu Glu Ala Thr Ala Trp Thr Glu Asp Gly Leu He Met Ala Ala Arg 
225 230 235 240 

His Lys Lys Tyr Lys His He Gin Gly Val Gin Phe His Pro Glu Ser 
245 250 255 

He He Thr Pro Glu Gly Lys Lys He He Leu Asn Phe Val Arg Phe 
260 265 270 

He Glu Glu Leu Glu Lys Gin Arg Ser 
275 280 



<210> 


21 


<211> 


856 


<212> 


DNA 


<213> 


Oryza : 


<220> 




<221> 


unsure 


<222> 


(811) 


<220> 




<221> 


unsure 


<222> 


(816) . 


<220> 




<221> 


unsure 


<222> 


(831) 


<220> 




<221> 


unsure 


<222> 


(836) 


<400> 


21 



(816) 



ttgccgttca gtcacagcaa aagaagccaa aaacttgatc aaatttctga gcaactccgc 60 
cactcttctc ctccgatggc caccgccgcg cggctcctcc ccaagatcca gtcccccgcc 120 
tccccggccg tcgcggaggc gcggaggcgc gcccctccag tctccgatta ggagttacta 180 
gtggacccgc aagaactctg aagcaaaagc ttgttgctaa gagtgctgtt tctgtggtgg 240 
aaggtgaaaa cgcatttgat ggagtaaagc aagatactag accaatcata gttatagata 300 
actacgatag cttcacgtat aatttatgcc agtacatggg tgaggtggga gctaactttg 360 
aggtgtaccg caatgattga tatcaccgtg gaagaaatta agaagatttc tcctagagga 420 
atactcatct cccctggccc tggcacacct caagattcag gaatatcatt gcaaacagtt 480 

14 
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caagatcttg gaccttctac acctttgttt ggggtttgca tgggtttgca gtgtattggg 540 

gaggcatttg gagggaaggt tgttcgttct ccttatggag ttgtgcatgg gaaaggatcc 600 

cttgttcact atgaggagaa acttgatgga acactgtttt ctggtctccc aaacccattc 660 

caagcgggaa gataccacag ccttgtaatt gagaaggata gcttcccaca tgatgccctg 720 

gaaattactg cttggacaga cgatgggctg atcatggctg ctcgcacaag gaagtccaaa 780 

caatatacag ggtggtgcag tccatccaag ngagcnatca taacaccaga ngggangctc 840 
aatggtccaa gaattt 

<210> 22 

<211> 247 

<212> PRT 

<213> Oryza sativa 

<400> 22 

Ala Thr Pro Pro Leu Phe Ser Ser Asp Gly His Arg Arg Ala Ala Pro 
15 10 15 

Pro Gin Asp Pro Vai Pro Arg Leu Pro Gly Arg Arg Gly Gly Ala Glu 
20 25 30 

Ala Arg Pro Ser Ser Leu Arg Leu Gly Val Thr Ser Gly Pro Ala Arg 
35 40 45 

Thr Leu Lys Gin Lys Leu Val Ala Lys Ser Ala Val Ser Val Val Glu 
50 55 60 

Gly Glu Asn Ala Phe Asp Gly Val Lys Gin Asp Thr Arg Pro He He 
65 70 75 80 

Val He Asp Asn Tyr Asp Ser Phe Thr Tyr Asn Leu Cys Gin Tyr Met 
85 90 95 

Gly Glu Val Gly Ala Asn Phe Glu Val Tyr Arg Asn Asp Asp He Thr 
100 105 110 

Val Glu Glu He Lys Lys He Ser Pro Arg Gly He Leu He Ser Pro 
115 120 125 

Gly Pro Gly Thr Pro Gin Asp Ser Gly He Ser Leu Gin Thr Val Gin 
130 135 140 

Asp Leu Gly Pro Ser Thr Pro Leu Phe Gly Val Cys Met Gly Leu Gin 
145 150 155 160 

Cys He Gly Glu Ala Phe Gly Gly Lys Val Val Arg Ser Pro Tyr Gly 
165 170 175 

Val Val His Gly Lys Gly Ser Leu Val His Tyr Glu Glu Lys Leu Asp 
180 185 190 

Gly Thr Leu Phe Ser Gly Leu Pro Asn Pro Phe Gin Ala Gly Arg Tyr 
195 200 205 

His Ser Leu Val He Glu Lys Asp Ser Phe Pro His Asp Ala Leu Glu 
210 215 220 

He Thr Ala Trp Thr Asp Asp Gly Leu He Met Ala Ala Arg Thr Arg 
225 230 235 240 

Lys Ser Lys Gin Tyr Thr Gly 
245 

<210> 23 
<211> 468 
<212> DNA 

<213> Triticum aestivum 

15 
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<220> 

<221> unsure 
<222> (7) 



<400> 23 

ggaaccnttc 

ggctgctcgc 

atcatcaccc 

gagaagcaga 

gagactattg 

ggaaatgaat 

tcattgtgga 

ctttctcaaa 



ccgcacgttg 
ccacaagaag 
ctgaaggcaa 
cctccgaggg 
ttgagagatc 
aagggaaaaa 
tctagatccc 
taataaatcg 



aactggaagt 
tacacgcata 
gaaaatcatc 
gaagtagagg 
aggcaggcag 
gagcttttcc 
gtatgaactt 
gtggaagcaa 



cacggcatgg 
tccagggtgt 
ctcaactttg 
attcaggaag 
gaaggggcct 
attgccattc 
cattcggtca 
aaaaaaaaaa 



actgaagatg 
tcagttccca 
cgagatatgt 
cagagcagtt 
aaaagccaag 
gtttggttca 
attatgttcg 
aaaactcg 



gacttgtcat 60 
cccagagagc 120 
cgaggagttt 180 
agaggaagta 240 
cagatttagg 300 
ttgtaatata 360 
tcgtggtgct 420 
468 



<210> 24 

<211> 67 

<212> PRT 

<213> Triticum aestivum 

<400> 24 

Glu Pro Phe Pro His Val Glu Leu Glu Val Thr Ala Trp Thr Glu Asp 
1 5 10 . 15 

Gly Leu Val Met Ala Ala Arg His Lys Lys Tyr Thr His He Gin Gly 
20 25 30 

Val Gin Phe His Pro Glu Ser He He Thr Pro Glu Gly Lys Lys He 
35 40 45 

He Leu Asn Phe Ala Arg Tyr Val Glu Glu Phe Glu Lys Gin Thr Ser 
50 55 60 

Glu Gly Lys 
65 

<210> 25 
<211> 1181 
<212> DNA 
<213> Zea mays 



<400> 25 

gcacgagctg 

aggccgccgc 

cgactggtag 

cggcggccgc 

ggcagcaggg 

ccacagcaaa 

tgccttactc 

tggcaaaggg 

tgtcctgccc 

acttcatgag 

tggaagagac 

tgacaacacc 

ttatttatct 

tgcaatctct 

gtgtgtcgac 

tcggtagcgc 

agctagaaga 

agcaaagaaa 

gtgtagtgaa 

taaaaaaaaa 



aatccaccac 
cgcggggtcg 
ggtctcgttc 
cgctgtggcg 
gaagactgcc 
agcactcagg 
cgatccattg 
caccacattt 
tgtagcgctt 
tattgttaaa 
agatgtcttg 
gactacacca 
tgtaagcact 
tcttcaggat 
tccagagcat 
agtgatgaag 
gttcgccaag 
ttaagatttt 
aataattgtg 
aaaaaaaaac 



ccacccacac 
gcgtcgttct 
cggagcgccg 
gaggacaagc 
ttgattccct 
atccttgatg 
gctgacggcc 
gaggatgtca 
ttcacatatt 
gaagctgggg 
aggagtgagg 
aatgaaagaa 
gttggagtta 
atcaagaatg 
gtccggcaga 
acgttggagg 
aacctgaagg 
tttttccttt 
tccctatgtt 
tcgagggggg 



gacaaaaatc 
ccgccgcggg 
cgcccgtggt 
gcagcatctc 
tcatcactgc 
cgtgtggttc 
ctgttattca 
tctccatggt 
acaacccgat 
tacacggtct 
ctgccaaaaa 
tggagaaaat 
ctggtacacc 
tgacagaaaa 
ttgcaggatg 
aagctgcttc 
ccgcattgcc 
caatcttggc 
gaatgctcaa 
gcccgtaccc 



cgccatggcc 
gccgaggcgg 
cgccgtcagg 
cggcaccttc 
tggagaccct 
agacgtgatc 
ggcctccgcg 
aaagggggtg 
cctgaagcgt 
tgtggtacca 
caacctagag 
tgcacaggct 
tgcaaatgta 
acccgtggct 
gggcgcagat 
tccagaagaa 
gtgaaccaac 
tttcagggtt 
atgaataaat 
a 



ttcgcgctca 
cgcgccgcgg 
gccgcggcgg 
gccgagctca 
gacttggcca 
gaactgggtg 
acgcgcgctc 
atacctgatc 
ggtgtcccca 
gatgttcctt 
ctggtgctat 
tcacaaggat 
agcggcaagg 
gttggctttg 
ggtgtgatta 
ggattgaaga 
cattttcgtt 
gttggttgtt 
caagatccca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1181 



<210> 
<211> 



26 
319 
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SUBSTITUTE SHEET (RULE 26) 



BNSOOCIO: <WO 994905eA2J_> 



wo 99/49058 



PCT/LIS99/06046 



<212> PRT 
<213> Zea mays 

<400> 26 

Met Ala Phe Ala Leu Lys Ala Ala Ala Ala Gly Ser Ala Ser Phe Ser 
15 10 15 

Ala Ala Gly Pro Arg Arg Arg Ala Ala Ala Thr Gly Arg Val Ser Phe 
20 25 30 

Arg Ser Ala Ala Pro Val Val Ala Val Arg Ala Ala Ala Ala Ala Ala 
35 40 45 

Ala Ala Val Ala Glu Asp Lys Arg Ser lie Ser Gly Thr Phe Ala Glu 
50 55 60 

Leu Arg Gin Gin Gly Lys Thr Ala Leu lie Pro Phe He Thr Ala Gly 
65 70 75 80 

Asp Pro Asp Leu Ala Thr Thr Ala Lys Ala Leu Arg He Leu Asp Ala 
85 90 95 

Cys Gly Ser Asp Val He Glu Leu Gly Val Pro Tyr Ser Asp Pro Leu 
100 105 110 

Ala Asp Gly Pro Val He Gin Ala Ser Ala Thr Arg Ala Leu Ala Lys 
115 120 125 

Gly Thr Thr Phe Glu Asp Val He Ser Met Val Lys Gly Val He Pro 
130 135 140 

Asp Leu Ser Cys Pro Val Ala Leu Phe Thr Tyr Tyr Asn Pro He Leu 
145 150 155 160 

Lys Arg Gly Val Pro Asn Phe Met Ser He Val Lys Glu Ala Gly Val 
165 170 175 

His Gly Leu Val Val Pro Asp Val Pro Leu Glu Glu Thr Asp Val Leu 
180 185 190 

Arg Ser Glu Ala Ala Lys Asn Asn Leu Glu Leu Val Leu Leu Thr Thr 
195 200 205 

Pro Thr Thr Pro Asn Glu Arg Met Glu Lys He Ala Gin Ala Ser Gin 
210 215 220 

Gly Phe lie Tyr Leu Val Ser Thr Val Gly Val Thr Gly Thr Pro Ala 
225 230 235 240 

Asn Val Ser Gly Lys Val Gin Ser Leu Leu Gin Asp He Lys Asn Val 
245 250 255 

Thr Glu Lys Pro Val Ala Val Gly Phe Gly Val Ser Thr Pro Glu His 
260 265 270 

Val Arg Gin He Ala Gly Trp Gly Ala Asp Gly Val He He Gly Ser 
275 280 285 

Ala Val Met Lys Thr Leu Glu Glu Ala Ala Ser Pro Glu Glu Gly Leu 
290 295 300 

Lys Lys Leu Glu Glu Phe Ala Lys Asn Leu Lys Ala Ala Leu Pro 
305 310 315 

<210> 27 
<211> 1243 

17 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9949058A2J_> 



wo 99/49058 



PCT/US99/06046 



<212> DNA 
<213> Zea mays 



<400> 27 

gcacgagcgc 

cgccactcgg 

ccacaaaggc 

ccatggccaa 

tcagggagca 

ttcatcccgt 

attctcaatt 

gctgatgggc 

gattctgtca 

ttcacatact 

caagctggca 

aggagcgagg 

acagatcgga 

gtaggggtta 

agcaagaagg 

gtgaaacaga 

cagttgtgtg 

agtatgaagg 

aacatggttt 

acgggtcgtt 

taaactgagt 



aagcggatcg 
ttcaccgccc 
agcgctcgga 
cggcggcgct 
aggcaagggt 
tcataactgc 
cctgcggttc 
cagttattca 
tagagatgct 
ataatccaat 
tacatggact 
ccattatgca 
tgaagggaat 
caggtgcacg 
ttacagacaa 
tcgtgggctg 
aggctgctac 
ctgctatgcc 
tagcaatatt 
ctccggtggt 
tagtgcgaaa 



caaactcgca 
gccgccgccg 
ggtggccctc 
gcggccggca 
tggattgctg 
tggtgatcct 
ggatgttata 
ggcttctgca 
gaagggggtt 
tctgaaacgt 
tgtagtgcct 
caacatagag 
tgcacaagct 
ctcaaatgta 
accggtagct 
gggagcggac 
tcctgaagag 
gtgagctgat 
ctgggcatgg 
gagaacgaag 
aaaaagataa 



atcaaggctg 
agacctcctc 
tgtcgggtgg 
agctcaccgt 
gcgccgtggt 
gacctggtaa 
gaggttggcg 
acacgggcgc 
acacctgagc 
ggtgtgggaa 
gatcttcctt 
ctggtgctgc 
tcagaaggat 
aacttacgtg 
gttggtttcg 
ggtgtgattg 
ggtctggaaa 
ctcactcccc 
ggttggcatc 
tctggctttg 
gccttaaaaa 



ctcccctccg 
ctcgccggct 
agtggtggac 
cgccgagacc 
ggccgatacc 
coacatcgaa 
taccatactc 
ttaagaaagg 
tttcttgccc 
acttcatgtc 
tggaggagac 
ttacaacgcc 
ttctctatct 
tagaacacct 
gtgtatcgac 
ttggcagtgc 
ggctcgagga 
gtcacaggca 
cataactcat 
cctataatag 
aaa 



tctccgtgtt 
actcctgacc 
acatccgcgc 
ttctccaacc 
caatagtgca 
agcattgaag 
ggatccactg 
cactacactt 
cattgttctt 
tactatcaaa 
agcgttgctg 
aaccacgccg 
tgtgagcgct 
tcttagggag 
tccagagcat 
gattgtgaag 
gtacgccagg 
aggtaattag 
ctagtgctgt 
aaaagggtta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1243 



<210> 28 

<211> 347 

<212> PRT 

<213> Zea mays 

<400> 28 

Ala Arg Ala Gin Ala Asp Arg Lys Leu Ala He Lys Ala Ala Pro Leu 
15 10 15 

Arg Leu Arg Val Arg His Ser Val His Arg Pro Pro Pro Pro Arg Pro 
20 25 30 

Pro Pro Arg Arg Leu Leu Leu Thr Pro Gin Arg Gin Arg Ser Glu Val 
35 40 45 

Ala Leu Cys Arg Val Glu Trp Trp Thr His Pro Arg Pro Trp Pro Thr 
50 55 60 

Ala Ala Leu Arg Pro Ala Ser Ser Pro Ser Pro Arg Pro Ser Pro Thr 
65 70 75 80 

Ser Gly Ser Lys Ala Arg Val Gly Leu Leu Ala Pro Trp Trp Pro He 
85 90 95 

Pro Asn Ser Ala Phe He Pro Phe He Thr Ala Gly Asp Pro Asp Leu 
100 105 110 

Val Thr Thr Ser Lys Ala Leu Lys He Leu Asn Ser Cys Gly Ser Asp 
115 120 125 

Val He Glu Val Gly Val Pro Tyr Ser Asp Pro Leu Ala Asp Gly Pro 
130 135 140 

Val He Gin Ala Ser Ala Thr Arg Ala Leu Lys Lys Gly Thr Thr Leu 
145 150 155 160 

Asp Ser Val He Glu Met Leu Lys Gly Val Thr Pro Glu Leu Ser Cys 
165 170 175 
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SUBSTITUTE SHEET (RULE 26) 

BNSOOCIO: <WO 9949058A2J_> 



wo 99/49058 



PCT/LIS99/06046 



Pro lie Vai Leu Phe Thr Tyr Tyr Asn 
180 185 



Pro lie Leu Lys Arg Gly Val 
190 



Gly Asn Phe Met Ser Thr lie 
195 

Val Pro Asp Leu Pro Leu Glu 

210 215 

lie Met His Asn He Glu Leu 
225 230 

Thr Asp Arg Met Lys Gly He 
245 



Lys Gin Ala Gly lie His Gly Leu Val 
200 205 

Glu Thr Ala Leu Leu Arg Ser Glu Ala 
220 

Val Leu Leu Thr Thr Pro Thr Thr Pro 
235 240 

Ala Gin Ala Ser Glu Gly Phe Leu Tyr 
250 255 



Leu Vai Ser Ala Val Gly Val Thr Gly 
260 265 

Arg Vai Glu His Leu Leu Arg Glu Ser 
275 280 



Ala Pro Ser Asn Val Asn Leu 
270 

Lys Lys Val Thr Asp Lys Pro 
285 



Val Ala Val Gly Phe Gly Val 
290 295 



Ser Thr Pro Glu His Val Lys Gin He 
300 



Val Gly Trp Gly Ala Asp Gly 
305 310 



Val lie Vai Gly Ser Ala He Vai Lys 
315 320 



Gin Leu Cys Glu Ala Ala Thr 
325 



Pro Glu Glu Gly Leu Glu Arg Leu Glu 
330 335 



Glu Tyr Ala Arg Ser Met Lys Ala Ala 
340 345 



Met Pro 



<210> 29 

<211> 831 

<212> DNA 

<213> Oryza sativa 



<400> 29 

tcctgccctg 

ttcatgactg 

gaagagacaa 

acaacaccaa 

atttatcttg 

caatctcttc 

tatatcgact 

tgggagcgct 

gctagaagag 

ggaaagaaaa 

acttttgcta 

gttagttgct 

taacactgtt 

ataatgtcgt 



tagcgctttt 
ttgtaaaaga 
atattttgag 
ctacaccaac 
taagcactgt 
tccaggatat 
ccagagcatg 
atggtgaggc 
ctagccaaga 
tacagtccat 
ttggggtttt 
tgaacgaata 
ttcgctttgt 
aattcaccta. 



cacatattat 
agccggtgtc 
gagtgaagct 
agaaagaatg 
tggagttaca 
caagcaggtc 
tgaagcagat 
aattgggcga 
gcctgaaggc 
ttttttgtag 
cggggttact 
aatcaaggta 
cgttggtgca 
aaaaaaaaaa 



aacccgattc 
catggtcttg 
gctaagaaca 
gagaaaatta 
ggtgcacgtg 
acggacaaag 
tgcgggatgg 
agctgcttca 
tgcattgccc 
tcaggtttga 
gttgtctgtt 
agaatgatac 
aatatcgaag 
aaaaaaaaaa 



tgaagcgtgg 
tggtacctga 
accttgagct 
caaaagcttc 
caaatgtcag 
cttgtggctg 
ggcgcagatg 
cccgaagaag 
tgaaacaatc 
agcatagtat 
gaaaataatt 
taaatgttgc 
agcggatata 
aaaaaaaaaa 



catcgcaaac 60 
tgttcctctg 120 
ggtgctactg 180 
tgaaggattt 240 
tggcaaggtg 300 
ttgggttcgg 360 
gtgtgatcat 420 
gattgaagaa 480 
agttccatga 540 
tgggagcatg 600 
aagtcattgt 660 
gaaaagggcc 720 
aacggagaaa 780 
a 831 



<210> 30 

<211> 179 

<212> PRT 

<213> Oryza sativa 

<400> 30 

Ser Cys Pro Vai Ala Leu Phe Thr Tyr Tyr Asn Pro He Leu Lys Arg 
1 5 10 15 

Gly He Ala Asn Phe Met Thr Val Val Lys Glu Ala Gly Val His Gly 
20 25 30 
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SUBSTITUTE SHEET (RULE 26) 



wo 99/49058 



PCT/US99/06046 



Leu Val Val Pro Asp Vai Pro Leu Glu Glu Thr Asn lie Leu Arg Ser 
35 40 45 

Glu Ala Ala Lys Asn Asn Leu Glu Leu Val Leu Leu Thr Thr Pro Thr 
50 55 60 

Thr Pro Thr Glu Arg Met Glu Lys He Thr Lys Ala Ser Glu Gly Phe 
65 70 75 80 

He Tyr Leu Val Ser Thr Val Gly Val Thr Gly Ala Arg Ala Asn Val 
85 90 95 

Ser Gly Lys Val Gin Ser Leu Leu Gin Asp He Lys Gin Val Thr Asp 
100 105 110 

Lys Ala Cys Gly Cys Trp Val Arg Tyr He Asp Ser Arg Ala Cys Glu 
115 120 125 

Ala Asp Cys Gly Met Gly Arg Arg Trp Cys Asp His Trp Glu Arg Tyr 
130 135 140 

Gly Glu Ala He Gly Arg Ser Cys Phe Thr Arg Arg Arg He Glu Glu 
145 150 155 160 

Ala Arg Arg Ala Ser Gin Glu Pro Glu Gly Cys He Ala Leu Lys Gin 
165 170 175 

Ser Val Pro 



<210> 31 

<211> 1226 

<212> DNA 

<213> Oryza sativa 



<400> 31 

cgcgcacgtg 

ttgccgtcgc 

gcggcggagg 

cgctcatgga 

cggagacctt 

ccagtgaccc 

cagatgtgat 

aggctgcagc 

tcaagggggt 

ttttaaagcg 

ttgtagtacc 

atggcataga 

ttgcaaaagc 

gctcaaatgt 

aacctgttgc 

ggggagcaga 

cacctgaaga 

cttgagatga 

ttttttttgt 

gaatcatgac 

agaaggtagc 



ctcatcgcga 
cgccgcggcg 
cggggcgggg 
gatggaagat 
ctccaggctc 
tgacttggca 
tgagctgggt 
aacgcgtgct 
tatacctgag 
cggagtgagt 
tgatcttcct 
gctggtgctg 
ttcagaagga 
aaatttgcgt 
tgttggcttt 
tggtgtgatc 
agggttgaaa 
gcggttccat 
ctgattaaat 
tgtatctcac 
tcagaaaaaa 



ggccacccgc 
gcgtctctct 
ctgggcctgt 
agcggtcgcg 
cgggagcagg 
accacatcga 
gtaccttact 
cttaagaaag 
ttgtcttgtc 
aacttcatgg 
ttggaagaga 
cttacaacac 
tttatttatc 
gttgagtacc 
ggcatatcga 
atcggcagcg 
agagtagaag 
gcctccaccc 
tgcatagcat 
tctagtacta 
aaaaaa 



tccgccgcct 
cctccgcgtg 
cggccaggga 
gcgtcgtggg 
gcaagactgc 
aagcgttgaa 
cggatccgtt 
gcgctacatt 
caatagttat 
ctatcattaa 
cagcgctgtt 
caaccacacc 
ttgtaagttc 
ttcttcagga 
ctccagagca 
ctattgttag 
agtatgccaa 
atgcaagcca 
tttaatccag 
tccttctgtt 



ccatttcgtc 
ggctgctctg 
cagggagcca 
cgccgggaag 
attcattcca 
gatccttgat 
ggctgatggg 
tgattccgtc 
cttcacatac 
acaagccggc 
gaggaatgag 
aacagagagg 
tgttggagtt 
gatcaagaag 
tgtaaagcag 
acagttagga 
gaacatgaag 
acaagaaggc 
tattttcttg 
aacatacaga 



gccgtcgccg 
cttcgtttcc 
gggaggttgc 
cgcggcgtcg 
ttcatcaccg 
tcctgtggtt 
ccggttattc 
atagctatgc 
tacaacccaa 
gtacatggtc 
gctgtcatgc 
atgaaggaga 
acaggtgcac 
gttacagaca 
attgcaggct 
gaggctgctt 
gctgctatgc 
agtacatcgt 
tatgattcat 
tgaataataa 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1226 



<210> 32 

<211> 340 

<212> PRT 

<213> Oryza sativa 

<400> 32 

Arg Thr Cys Ser Ser Arg Gly His Pro Leu Arg Arg Leu His Phe Vai 
15 10 15 



20 



SUBSTITUTE SHEET (RULE 26) 

BNSOOCIO: <W0 9949058A2J_> 



wo 99/49058 



PCT/US99/06046 



Ala Val Ala Val Ala Val Ala Ala Ala Ala Ala Ser Leu Ser Ser Ala 
20 25 30 

Trp Ala Ala Leu Leu Arg' Phe Arg Gly Gly Giy Gly Ala Gly Leu Gly 
35 40 45 

Leu Ser Ala Arg Asp Arg Glu Pro Gly Arg Leu Pro Leu Met Glu Met 
50 55 60 

Glu Asp Ser Gly Arg Gly Val Val Gly Ala Gly Lys Arg Gly Val Ala 
65 70 75 80 

Glu Thr Phe Ser Arg Leu Arg Glu Gin Gly Lys Thr Ala Phe He Pro 
85 90 95 

Phe He Thr Ala Ser Asp Pro Asp Leu Ala Thr Thr Ser Lys Ala Leu 
100 105 110 

Lys He Leu Asp Ser Cys Gly Ser Asp Val lie Glu Leu Gly Val Pro 
115 120 125 

Tyr Ser Asp Pro Leu Ala Asp Gly Pro Val He Gin Ala Ala Ala Thr 
130 135 140 

Arg Ala Leu Lys Lys Giy Ala Thr Phe Asp Ser Val He Ala Met Leu 
145 150 155 160 

Lys Gly Val lie Pro Glu Leu Ser Cys Pro He Val He Phe Thr Tyr 
165 170 175 

Tyr Asn Pro He Leu Lys Arg Gly Val Ser Asn Phe Met Ala He He 
180 185 190 

Lys Gin Ala Giy Val His Giy Leu Val Val Pro Asp Leu Pro Leu Glu 
195 200 205 

Glu Thr Ala Leu Leu Arg Asn Glu Ala Val Met His Gly He Glu Leu 
210 215 220 

Val Leu Leu Thr Thr Pro Thr Thr Pro Thr Glu Arg Met Lys Glu He 
225 230 235 240 

Ala Lys Ala Ser Glu Giy Phe He Tyr Leu Val Ser Ser Val Giy Val 
245 250 255 

Thr Gly Ala Arg Ser Asn Val Asn Leu Arg Val Glu Tyr Leu Leu Gin 
260 265 270 

Glu He Lys Lys Val Thr Asp Lys Pro Val Ala Val Gly Phe Gly He 
275 280 285 

Ser Thr Pro Glu His Val Lys Gin He Ala Gly Trp Gly Ala Asp Giy 
290 295 300 

Val He He Gly Ser Ala He Val Arg Gin Leu Gly Glu Ala Ala Ser 
305 310 315 320 

Pro Glu Glu Giy Leu Lys Arg Val Glu Glu Tyr Ala Lys Asn Met Lys 
325 330 335 

Ala Ala Met Pro 
340 

<210> 33 
<211> 533 



21 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9949058A2J_> 



wo 99/49058 



PCT/US99/06046 



<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (500) 

<400> 33 

gagagaagaa acccaataag tgattctcat ttttatggct cttgccctga agtcaagttg 60 
cttcttgcaa ttgaagaaac ctgaggctgg tttcaatgtt tgtttttcat ctaagaaagc 120 
tataatctcc gtgaaaagat acactccggt ggccgctatc aggatcatgg acgcagtggg 180 
actctctgca acattcacca gattgaaaaa agaaggaaaa gtggctttca tcccatatat 240 
cactgctggt gatcctgacc tttcaaccac tgcagaagca ctgaaattgc ttgattcatg 300 
cgggtctgac ataattgagc taggcattcc atactctgat cctttggcag atggtcctgt 360 
tatccaggct gctgctacaa gatctttagc aaaggggacc aattttaatg tgattattga 420 
tatgttgaag gaggttgttc cgcaactgtc atgtccaatt gcactgttta cataattaca 480 
atccaaatac ctggaaagcn gtgggtactg gagaaaattt atgtccaacc aat 533 

<210> 34 

<211> 146 

<212> PRT 

<213> Glycine max 

<400> 34 

Met Ala Leu Ala Leu Lys Ser Ser Cys Phe Leu Gin Leu Lys Lys Pro 
15 10 15 

Glu Ala Gly Phe Asn Val Cys Phe Ser Ser Lys Lys Ala He He Ser 
20 25 30 

Val Lys Arg Tyr Thr Pro Val Ala Ala He Arg lie Met Asp Ala Val 
35 40 45 

Gly Leu Ser Ala Thr Phe Thr Arg Leu Lys Lys Glu Gly Lys Val Ala 
50 55 60 

Phe He Pro Tyr He Thr Ala Gly Asp Pro Asp Leu Ser Thr Thr Ala 
65 70 75 80 

Glu Ala Leu Lys Leu Leu Asp Ser Cys Gly Ser Asp He He Glu Leu 
85 90 95 

Gly He Pro Tyr Ser Asp Pro Leu Ala Asp Gly Pro Val He Gin Ala 
100 105 110 

Ala Ala Thr Arg Ser Leu Ala Lys Gly Thr Asn Phe Asn Val He He 
115 120 125 

Asp Met Leu Lys Glu Val Val Pro Gin Leu Ser Cys Pro He Ala Leu 
130 135 140 

Phe Thr 
145 

<210> 35 

<211> 516 

<212> DNA 

<213> Triticum aestivum 
<220> 

<221> unsure 

<222> (2) 

<220> 

<221> unsure 
<222> (4) 

22 



SUBSTITUTE SHEET (RULE 26) 

BNSCXDCID: <WO ^9949058A2J_> 



wo 99/49058 PCT/US99/06046 



<220> 

<221> unsure 
<222> (68) 



<220> 

<221> unsure 
<222> (173) 



<220> 

<221> unsure 
<222> (198) 



<220> 

<221> unsure 

<222> (212) 

<220> 

<221> unsure 

<222> (221) 



<220> 

<221> unsure 
<222> (224) 



<220> 

<221> unsure 
<222> (235) 



<220> 

<221> unsure 

,<222> (368) 

<220> 

<221> unsure 

<222> (393) 

<220> 

<221> unsure 

<222> (414) 

<220> 

<221> unsure 

<222> (428) 



<220> 

<221> unsure 

<222> (437) 

<220> 

<221> unsure 

<222> (442) 

<220> 

<221> unsure 

<222> (461) 

<220> 

<221> unsure 

<222> (478) 



<220> 

<221> unsure 
<222> (483) 
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SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9949058A2J_> 



wo 99/49058 



PCT/US99/06046 



<220> 

<221> unsure 
<222> (493) 

<220> 

<221> unsure 
<222> (510) 

<400> 35 

angntaagcc ctccatcgtt tctctccaca gaacggattc aatgtctgga aatccagcaa 60 

gccacggnct cggcggggtc gctcgccgag gcgcccgtgc ctgtcgccgc cggcgaccgc 120 

ggcctgtccg tgtcgcaagc catgtcaaaa gtcatggaga agggcaagac ggncttaatc 180 

ccgtacatca acgccggnga ccccgacctg gngacgacgg nggnggcgct gaggntcctc 24 0 

gacgccctgg gcgcggacgt cgtcgagctc ggcatgccct tctcggatgc ctccgccgac 300 

ggggncgtca tcaaagactc cgcggcgcgc gcgctggccg ctggcgcgac ggctgacgcc 360 

atcatggnga tgctgaagga ggtganaccg canctttcct gccccgtggg gatnttctct 420 

tacttcanac ccattgnaca angagggatg gggagcttcc ncccccccgt caaagaancc 480 

ggnggtgaaa ggncttataa taacccgatn tttctt 516 

<210> 36 

<211> 159 

<212> PRT 

<213> Triticum aestivum 
<220> 

<221> UNSURE 

<222> (21) 

<220> 

<221> UNSURE 
<222> (56) 

<220> 

<221> UNSURE 
<222> (72).. (73) 

<220> 

<221> UNSURE 
<222> (77) 

<220> 

<221> UNSURE 
<222> (100) 

<220> 

<221> UNSURE 
<222> (121) 

<220> 

<221> UNSURE 
<222> (127) 

<220> 

<221> UNSURE 
<222> (129) 

<220> 

<221> UNSURE 
<222> (136) 

<220> 

<221> UNSURE 
<222> (141) 
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SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <W0 ^9949058A2_L> 



wo 99/49058 



PCT/US99/06046 



<220> 

<221> UNSURE 
<222> (144) 



<220> 

<221> UNSURE 
<222> (146) 



<220> 

<221> UNSURE 

<222> (152) 

<220> 

<221> UNSURE 

<222> (158) 



<400> 36 
Ser Pro Pro Ser 
1 

Gin Gin Ala Thr 
20 

Vai Ala Ala Gly 
35 

Vai Met Giu Lys 
50 

Asp Pro Asp Leu 
65 

Leu Gly Ala Asp 



Ala Asp Gly Xaa 
100 

Gly Ala Thr Ala 
115 

Xaa Leu Ser Cys 
130 

Gin Xaa Gly Met 
145 



Phe Leu Ser Thr 
5 

Xaa Ser Ala Gly 



Asp Arg Gly Leu 
40 

Gly Lys Thr Xaa 
55 

Xaa Thr Thr Xaa 
70 

Vai Vai Giu Leu 
85 

Vai lie Lys Asp 



Asp Ala He Met 
120 

Pro Vai Gly Xaa 
135 

Gly Ser Phe Xaa 
150 



Giu Arg lie Gin 
10 

Ser Leu Ala Giu 
25 

Ser Vai Ser Gin 



Leu He Pro Tyr 
60 



Xaa Ala Leu Arg 
75 

Gly Met Pro Phe 
90 

Ser Ala Ala Arg 
105 

Xaa Met Leu Lys 



Phe Ser Tyr Phe 
140 



Pro Pro Vai Lys 
155 



Cys Leu Giu He 
15 

Ala Pro Vai Pro 
30 

Ala Met Ser Lys 
45 

lie Asn Ala Gly 



Xaa Leu Asp Ala 
80 

Ser Asp Ala Ser 
95 

Ala Leu Ala Ala 
110 

Giu Vai Xaa Pro 
125 

Xaa Pro He Xaa 



Giu Xaa Gly 



<210> 37 

<211> 475 

<212> DNA 

<213> Triticum aestivum 



<400> 37 

gcacgagcgg acccctacat cgacgggccc 

gccagcggca ccaccatgga cgccgtgctg 

tcgtgccccg tggtgctcct ctcctactac 

atgaaagagg cgggggtcca cggtcttata 

tcgctgtgga gtgaagccaa gaacaacaac 

ataccagagg acaggatgac ggagatcacc 

agcgtgaacg gagtgacagg tcctcgcgca 

aagcgagtta agaaggtggc taacaagccc 



atcatccagg cgtcggtggc gcgggctctg 60 
gagatgctca gggaggtgac gccggagctg 120 
aagcccatca tgtctcgcag cttggccgag 180 
gtgcctgatc tcccgtacgt ggccgcgcac 240 
ctggagctgg tgctgctgac aacaccagcc 300 
aacgcttcaa aaggcttcgt ctacctgggg 360 
aacgtgaacc cgccagtggg ttcacccatc 420 
gttgccggtg ggcttcggga aatcc 475 



<210> 38 

<211> 158 

<212> PRT 

<213> Triticum aestivum 
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<400> 38 

Ala Arq Ala Asp Pro Tyr He Asp Gly Pro He He Gin Ala Ser Val 
15 10 15 

Ala Arq Ala Leu Ala Ser Gly Thr Thr Met Asp Ala Val Leu Glu Met 
20 25 30 

Leu Arq Glu Val Thr Pro Glu Leu Ser Cys Pro Val Val Leu Leu Ser 
35 40 . 45 

Tvr Tyr Lys Pro He Met Ser Arg Ser Leu Ala Glu Met Lys Glu Ala 
50 55 60 

Gly Val His Gly Leu He Val Pro Asp Leu Pro Tyr Val Ala Ala His 
65 70 75 80 

Ser Leu Trp Ser Glu Ala Lys Asn Asn Asn Leu Glu Leu Val Leu Leu 
85 90 95 

Thr Thr Pro Ala He Pro Glu Asp Arg Met Thr Glu He Thr Asn Ala 
100 105 110 

Ser Lys Giy Phe Val Tyr Leu Gly Ser Val Asn Gly Val Thr Gly Pro 
115 120 125 

Arg Ala Asn Val Asn Pro Pro Val Gly Ser Pro He Lys Arg Val Lys 
130 135 140 

Lys Val Ala Asn Lys Pro Val Ala Gly Gly Leu Arg Glu He 
145 150 155 

<210> 39 
<211> 608 
<212> PRT 

<213> Ruta graveolens 
<400> 39 

Met He Thr Leu Asn Val Glu Thr Pro Pro Leu Thr Arg Ser Gin Leu 
15 10 15 

Pro Ser Thr Phe Arg Val Ser Ser Ala Ala Ser Val Asn Phe Asn Asp 
20 25 30 

Arg Val Ala Thr Ser Arg Trp Arg Pro Asn Ser Leu Ser Leu Thr Thr 
35 40 45 

Ser Ser Tyr Arg Leu Arg Thr Leu Lys Cys Ala Ala Ser Ala Ser Thr 
50 55 60 

Ser Ala Ser Thr Ser Ala Ser Pro Ser Pro Ser Pro Ser Leu Val Asp 
65 70 75 80 

Gin Ser Ala Asn Phe His Glu Ala Ser Lys Lys Gly Asn Leu He Pro 
85 90 95 

Leu Tyr Arg Cys He Phe Ser Asp His Leu Thr Pro Val Leu Ala Tyr 
100 105 110 

Arg Cys Leu Val Lys Glu Asp Asp Arg Asp Ala Pro Ser Phe Leu Phe 
115 120 125 

Glu Ser Val Glu Pro Gly Ser Gin Ala Ser Ser He Gly Arg Tyr Ser 
130 135 140 



26 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 994905eA2_L> 



wo 99/49058 



PCTAJS99/06046 



Val Val Gly Ala Gin Pro Ala He Glu He Val Ala Lys Glu Asn Met 
145 150 155 160 

Val Thr He Leu Asp His Glu Gly Gly Gin Arg Thr Glu Gin Phe Val 
165 170 175 

Glu Asp Pro Met Asp Val Pro Arg Arg He Met Glu Gly Trp Lys Pro 
180 185 190 

Gin Leu He Asp Glu Leu Pro Glu Ala Phe Cys Gl-y Gly Trp Val Gly 
195 200 205 

Tyr Phe Ser Tyr Asp Thr Val Arg Tyr Val Glu Lys Lys Lys Leu Pro 
210 215 220 

Phe Phe Ser Ala Pro Thr Asp Asp Arg Asn Leu Pro Asp Val His Leu 
225 230 235 240 

Glv Leu Tyr Asp Asp Val He Val Phe Asp His Val Glu Lys Lys Ala 
245 250 255 

Phe Val He His Trp Val Arg Leu Asp Gin Tyr Ser Ser Val Ala Glu 
260 265 270 

Ala Tyr Asn Asp Gly Met Asn Arg Leu Glu Asn Leu Val Ser Arg Val 
275 280 285 

His Asp He Val Pro Pro Lys Leu Arg Ser Gly Ser He Lys Leu His 
290 295 300 

Thr Arg His Phe Gly Pro Lys Leu Glu Arg Ser Ser Met Thr Ser Glu 
305 310 315 320 

Ala Tyr Lys Glu Ala Val Leu Glu Ala Lys Glu His He Leu Ala Gly 
325 330 335 

Asp He Phe Gin He Val Leu Ser Gin Arg Phe Glu Arg Arg Thr Phe 
340 345 350 

Ala Asp Pro Phe Glu He Tyr Arg Ser Leu Arg He Val Asn Pro Ser 
355 360 365 

Pro Tyr Met Thr Tyr Leu Gin Ala Arg Gly Cys He Leu Val Ala Ser 
370 375 380 

Ser Pro Glu He Leu Thr Arg Val Lys Lys Arg Lys He Thr Asn Arg 
385 390 395 400 

Pro Leu Ala Gly Thr He Arg Arg Gly Lys Thr Arg Lys Glu Asp Leu 
405 410 415 

Val Phe Glu Lys Glu Leu Leu Asn Asp Glu Lys Gin Cys Ala Glu His 
420 425 430 

He Met Leu Val Asp Leu Gly Arg Asn Asp Val Gly Lys Val Ser Glu 
435 440 445 

Pro Gly Ser Val Lys Val Glu Lys Leu Met Asn He Glu His Tyr Ser 
450 455 460 

His Val Met His He Ser Ser Thr Val Thr Gly Glu Leu Leu Asp His 
465 470 475 480 

Leu Thr Ser Trp Asp Ala Leu Arg Ala Ala Leu Pro Val Gly Thr Val 
485 490 495 
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Ser Glv Ala Pro Lys Val Lys Ala Met Glu lie He Asp Lys Leu Glu 
500 505 510 

Val Thr Arg Arg Gly Pro Tyr Gly Gly Gly Phe Gly Gly He Ser Phe 
515 520 525 

Thr Gly Asp Leu Asp He Ala Leu Ala Leu Arg Thr Met Val Phe Gin 
530 535 540 

Thr Ala Thr Arg Tyr Asp Thr Met Tyr Ser Tyr Lys- Asp Val Asp Lys 
545 550 555 560 

Arg Arg Glu Trp lie Ala His Leu Gin Ala Gly Ala Gly He Val Ala 
565 570 575 . 

AsD Ser Asp Pro Ala Asp Glu Gin Arg Glu Cys Glu Asn Lys Ala Ala 
580 585 590 

Ala Leu Ala Arg Ala He Asp Leu Ala Glu Ser Ser Phe He Glu Lys 
595 600 605 

<210> 40 
<211> 276 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 40 r o 

Met Ala Ala Ser Thr Leu Tyr Lys Ser Cys Leu Leu Gin Pro Lys Ser 
15 10 15 

Gly Ser Thr Thr Arg Arg Leu Asn Pro Ser Leu Val Asn Pro Leu Thr 
20 25 30 

Asn Pro Thr Arg Val Ser Val Leu Gly Lys Ser Arg Arg Asp Val Phe 
35 40 45 

Ala Lys Ala Ser He Glu Met Ala Glu Ser Asn Ser He Pro Ser Val 
50 55 60 

Val Val Asn Ser Ser Lys Gin His Gly Pro He He Val He Asp Asn 
65 70 75 80 

Tyr Asp Ser Phe Thr Tyr Asn Leu Cys Gin Tyr Met Gly Glu Leu Gly 
85 90 95 

Cys His Phe Glu Val Tyr Arg Asn Asp Glu Leu Thr Val Glu Glu Leu 
100 105 110 

Lvs Lys Lys Asn Pro Arg Gly Val Leu He Ser Pro Gly Pro Gly Thr 
115 120 125 

Pro Gin Asp Ser Gly He Ser Leu Gin Thr Val Leu Glu Leu Gly Pro 
130 135 140 

Leu Val Pro Leu Phe Gly Val Cys Met Gly Leu Gin Cys He Gly Glu 
145 150 155 160 

Ala Phe Gly Gly Lys He Val Arg Ser Pro Phe Gly Val Met His Gly 
165 170 175 

Lvs Ser Ser Met Val His Tyr Asp Glu Lys Gly Glu Glu Gly Leu Phe 
180 185 190 

Ser Gly Leu Ser Asn Pro Phe He Val Gly Arg Tyr His Ser Leu Val 
195 200 205 
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lie Giu Lys Asp Thr Phe Pro Ser Asp Glu Leu Giu Val Thr Aia Trp 
210 215 220 

Thr Glu Asp Gly Leu Val Met Ala Ala Arg His Arg Lys Tyr Lys His 
225 230 235 240 

He Gin Gly Val Gin Phe His Pro Glu Ser lie He Thr Thr Glu Gly 
245 250 255 

Lys Thr He Val Arg Asn Phe He Lys He Val Giu Lys Lys Glu Ser 
260 265 270 

Glu Lys Leu Thr 
275 

<210> 41 

<211> 312 

<212> PRT 

<213> Arabidopsis thaiiana 

<400> 41 

Met Ala He Ala Phe Lys Ser Gly Val Phe Phe Leu Gin Ser Pro Lys 

1 5 10-15 

Ser Gin He Gly Phe Arg His Ser Ser Pro Pro Asp Ser Ser Leu Ser 
20 25 30 

Phe Lys Arg Phe Thr Pro Met Ala Ser Leu Ser Thr Ser Ser Pro Thr 
35 40 45 

Leu Gly Leu Ala Asp Thr Phe Thr Gin Leu Lys Lys Gin Gly Lys Val 
50 55 60 

Ala Phe He Pro Tyr He Thr Ala Gly Asp Pro Asp Leu Ser Thr Thr 
65 70 75 80 

Ala Glu Ala Leu Lys Val Leu Asp Ala Cys Gly Ser Asp He He Glu 
85 90 95 

Leu Gly Val Pro Tyr Ser Asp Pro Leu Ala Asp Gly Pro Val He Gin 
100 105 110 

Ala Ala Aia Thr Arg Ser Leu Glu Arg Gly Thr Asn Leu Asp Ser He 
115 120 125 

Leu Glu Met Leu Asp Lys Val Val Pro Gin He Ser Cys Pro lie Ser 
130 135 140 

Leu Phe Thr Tyr Tyr Asn Pro He Leu Lys Arg Gly Leu Gly Lys Phe 
145 150 ^ 155 160 

Met Ser Ser He Arg Ala Val Gly Val Gin Gly Leu Val Val Pro Asp 
165 170 175 

Val Pro Leu Glu Giu Thr Giu Met Leu Arg Lys Glu Ala Leu Asn Asn 
180 185 190 

Asp He Glu Leu Val Leu Leu Thr Thr Pro Thr Thr Pro Thr Giu Arg 
195 200 205 

Met Lys Leu He Val Asp Aia Ser Glu Gly Phe He Tyr Leu Val Ser 
210 215 220 

Ser He Gly Val Thr Gly Aia Arg Ser Ser Val Ser Gly Lys Val Gin 
225 230 235 240 
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Ser Leu Leu Lys Asp lie Lys Glu Ala Thr Asp Lys Pro Val Ala Val 
245 250 255 

Glv Phe Gly He Ser Lys Pro Glu His Val Lys Gin He Ala Giy Trp 
260 265 270 

Gly Ala Asp Gly Val lie Val Giy Ser Ala Met Vai Lys Leu Leu Gly 
275 280 285 

Asp Ala Lys Ser Pro Thr Glu Giy Leu Lys Glu Leu Glu Lys Leu Thr 
290 295 300 



Lys Ser Leu Lys Ser Ala Leu Leu 
305 310 



30 



SUBSTITUTE SHEET (RULE 26) 

BNSOOCIO: <WO 9949058A2_I_> 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Intemationai Bureau 



mm 



PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ^ : 
C12N 15/60, 9/88, 1/21, C12Q 1/68 



A3 



(11) International Publication Number: WO 99/49058 

(43) international Publication Date: 30 September 1999 (30.09.99) 



(21) International Application Number: PCT/US99/06046 

(22) International Filing Date: 19 March 1999 (19.03.99) 



(30) Priority Data: 

60/079,386 



26 March 1998 (26.03.98) 



US 



(71) Applicant (for all designated States except US): E.L DU PONT 

DE NEMOURS AND COMPANY [US/US]; 1007 Market 
Street, Wilmington. DE 19898 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): VOLLMER, Steven, 
J. fUS/USl; 1019 Tweedbrook Road, Wilmington, DE 
19810 (US). FALCO, Saverio. Carl [US/US]; 1902 Millers 
Road, Arden, DE 19810 (US). BROGLIE, Richard, M. 
[US/US]; 520 Port Royal Court, Landenberg, PA 19350 
(US). BRYAN, Gregory, T. [NZ/US]; 1215 Spruce Street, 
Wilmington. DE 19805 (US). CAHOON, Rebecca, E. 
[US/US]; 2331 West 1 8th Street, Wilmington, DE 19806 
(US). RAFALSKl, J., Antoni [US/US]; 2028 Longcome 
Drive. Wilmington, DE 19810 (US). 

(74) Agent: MAJARIAN, William, R.; E.L du Pont de Nemours 
and Company, Legal Patent Records Center, 1007 Market 
Street. Wilmington, DE 19898 (US). 



(81) Designated States: AL, AU, BA, SB, BG, BR. CA, CN. CU, 
CZ, EE, GD, GE, HR. HU, ID, IL, IN, IS, JP, KP, KR, 
LC, LK, LR, LT, LV, MG, MK, MN. MX, NO, NZ, PL, 
RO, S.G, SI, SK, SL, TR. TT, UA, US, UZ, VN. YU, ZA. 
ARIPO patent (GH, GM, KE, LS, MW, SD, SU SZ, UG, 
ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), European patent (AT, BE, CH, CY, DE, DK, ES, FI, 
FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, 
SN, TD, TG). 



Published 

iVitfi international search report. 

(88) Date of publication of the international search report: 

6 April 2000 (06.04.00) 



(54) Title: TRYPTOPHAN BIOSYNTHETIC ENZYMES 
(57) Abstract 

This invention relates to an isolated nucleic acid fragment encoding a tryptophan biosynthetic enzyme. The invention also relates 
to the construction of a chimeric gene encoding all or a portion of the tryptophan biosynthetic enzyme, in sense or antisense orientation, 
wherein expression of the chimeric gene results in production of altered levels of the tryptophan biosynthetic enzyme in a transformed host 
cell. 



BNS[X)CID: <W0 9949058A3J_> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCX on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


hV 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MP 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BC 


Bulgaria 


IIU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CC 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6ic d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






I)K 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 








Ostonia 


LR 


Liberia 


SG 


Singapore 







BNSDOCID: <WO 9949058A3J_> 



INTERNATIONAL SEARCH REPORT 




fnte onat Application No 




PCT/US 99/06046 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/60 C12N9/38 C12N1/21 C12Q1/68 



Accoroing to iniernationat Pateni Ciassiiicjiion iiFC- or ;o coth nanonai classification ana tPC 

B. FIELDS SEARCHED 

Mtnimum documentation searcned tctassification system toilowea Dy ciassittcatton symooisi 

IPC 6 C12N C120 



Documentation searcheo otner man minimum documentation to the extent that sucn documents are included in tne fields seafcned 



Elecironic data base consulted during the international search tname ot data toase and. where oractical. search terms usea) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' 



Citation o( document, with indication, where appropriate, ot the relevant passages 



Relevant to claim No. 



wo 97 26366 A (DEKALB GENETICS CORP 

; ANDERSON PAUL C (US); CHOMET PAUL S (US): 

GR) 24 July 1997 (1997-07-24) 

the whole document 

-/- 



1-34, 
54-59 



j )( I Further documents are listed in the continuation of box C. 



ID 



Patent family members are listed in annex. 



* Special categories or cited documents : 

"A" document defining the general state of the art which is not 

considered to be ol particular relevance 
'E" earlier document but published on or after the international 

tiling date 

"L" document which may throw doubts on pnority claim(s) or 
which is cited to establish the publication date ot another 
Citation or other special reason (as specitied) 

"O" document referring to an oral disclosure, use. exhibition or 
ether means 

"P" document published prior to the international tiling date but 
later than the pnonty date claimed 



"T" later document published after the international filing date 
or pnonty date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

'X" document of particular relevance: the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

'Y" document of particular relevance: the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date ot the actual completion ot the international search 

18 January 2000 


Date ot mailing of the international search repon 

0 3. 02. 00 


Name ana mailing address ol the ISA 

European Patent OHice. P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tef. (*3l-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+3 1-70) 340-3016 


Authonzed officer 

Smalt, R 



Form PCT/ISA/210 (socomj sheen (July 19921 



• BNSDOCID: <WO_9949058A3J_ 



page 1 of 4 



INTERNATIONAL SEARCH REPORT 



jnal Application No 



PCT/US 99/06046 



C.lContinuattoni DOCUMENTS CONSIDERED TO BE RELEVANT 



Station 01 -jccumsnt. Aim inoicaucn vm-fro accrconoi? -r^ r^i^vani po-isages 



. ^^levant ;o tutm no 



DATABASE EMBL - EMEST20 -Online I 

Entry ZM248. Acc.no.T25248. 

1 October 1994 (1994-10-01) 

SHEN, 8. ETAL.: "5clOh08 membrane-free 

polysomes from endosperm Zea mays cONA 

clone 5cl0h08 5" end similar to 

anthranilate synthase." 

XP0021I2345 

the whole document 

-& SHEN, B. ET AL.: "Partial sequencing 
and mapping of clones from two maize cDNA 
1 ibraries . " 

PLANT MOLECULAR BIOLOGY, 

vol. 26, 1994, pages 1085-1101, 

XP002042536 

the whole document 

BOHLMANN J ET AL: "PURIFICATION AND CDNA 
CLONING OF ANTHRANILATE SYNTHASE FROM RUTA 
GRAVEOLENS, MODES OF EXPRESSION AND 
PROPERTIES OF NATIVE AND RECOMBINANT 
ENZYMES" 

PLANT JOURNAL. GB,BLACKWELL SCIENTIFIC 

PUBLICATIONS, OXFORD, 

vol. 7, no. 3, page 491-501 XP002032323 

ISSN: 0960-7412 
the whole document 

TOZAWA, Y. ET AL. : "Isolation and 
analysis of gene coding for alpha-subunit 
of rice anthranilate synthase, (in 
Japanese)" 

JAPANESE JOURNAL OF BREEDING, 
vol. 46, no. 2, 

27 September 1996 (1996-09-27), page 28 
XP002917196 

see whole document; cited in search report 

of W09911800 (first citation). 

the whole document 

-& DATABASE EMBL - TREMBL 'Online! 

Entry/acc.no. Q9XJ29, 

1 November 1999 (1999-11-01) 

TOZAWA, Y. ET AL.: "Anthranilate synthase 

alpha 2 subunit." 

XP002127895 

the whole document 

-/-- 



1,2.10. 

18.22. 

23.28 



1,5,6, 
13,14 



4,9,11, 
18 



Form PCT/lSA/210 (continuaiion ol second sheen (July 1992) 
BNSDOCID: <WO_9949058A3J_> 



page 2 of 4 



5 



INTERNATIONAL SEARCH REPORT 

inte onai Apf 


xication No 


PCT/US 99 


/06046 


C.(Cominuat(on) DOCUMENTS CONSiOEHED TO BE RELEVANT 






^Alavani ;o •::aim Nc 


X ; 


DATABASE EMBL - EMPLN -Online! 


1 22-25.27 




Entry ZMORF. Acc.no.M95067. 1 




9 June 1992 (1992-06-09) 




KEITH, C.S. ET AL.: "Zea mays putative 1 




anthramlate synthase component II homolog 


- 




mRNA, partial cds 






XP002112490 






the whole document 






-& KEITH, C.S. ETAL.: "Partial sequence 






analysis of 130 randomly selected maize 






cONA clones." 






PLANT PHYSIOLOGY. 






vol. 101, 1993, pages 329-32. XP002112488 






the whole document 

— 




X 


NIYOGI K K ET AL: "SUPPRRESSORS OF TRPl 


1-4 , 




FLUORESCENCE IDENTIFY A NEW ARABIDOPSiS 


10-12, 




GENE, TRP4, ENCODING THE ANTHRANILATE 


22-26, 




SYNTHASE BETA SUBUNIT" 


29-34 




PLANT CELL, 






vol. 5, 1 September 1993 (1993-09-01), 






pages 1011-1027, XP002032325 






ISSN: 1040-4651 






abstract; figures 1,7,8 




X 


DATABASE EMBL - EMEST20 'Online! 


22,23.27 




Entrv ZM7731 Acc no T8773 






14 May 1994 (1994-05-14) 






SHEN, B. ETAL.: "2C06H01-T7 membrane-free 






polysomes from endosperm Zea mays cONA 






clone 2C06H01 5' end similar to 






glucocorticoid receptor." 






XP002112491 






the whole document 






-& SHEN, B, ET AL.: "Partial sequencing 






and mapping of clones from two maize cDNA 






1 ibraries . " 






PLANT MOLECULAR BIOLOGY, 






vol 26 November 1994 (1994-11) oaaes 






1085-1101, XP002042536 






the whole document 




X 


DATABASE EMBL - EHEST3 -Online! 


22,24,28 




Entry/Acc.No. AA751371. 






21 January 1998 (1998-01-21) 






NAHM, B.H. ET AL.: "ISYSOOS? Rice Immature 






Seed Lambda ZAPII cDNA Library Oryza 






sativa cDNA clone ISYS0037 similar to 






MZEORFH Zea mays putative anthranilate 






synthase component II homolog mRNA, 






partial cds." 






XP002112492 






the whole document 






-/-- 





Forni PCT/lSA/210 (connnuaoon ot second sheet) (July 19921 



page 3 of 4 

BNSDOCID: <WO 994905aA3J_> 



INTERNATIONAL SEARCH REPORT 



intei :;nai Application No 

PCT/US 99/06046 



CtContinuGtion) DOCUMENTS CONSIDERED TO BE RELEVANT 




';:ation ot accijment. Aitn mcicajion. Arero aoprODnaie. :t in-? r^wvant oassag^'S 


^.-sl-ivani ro ~:<nrr\ No 


X i 


RADWANSKI, E. R. ET AL. : "Arabidopsi 3 | 


35-53 




thalia tryptophan synthase alpha: gene j 






cloning, expression, and subunit 






interaction." 






MOLECULAR AND GENERAL GENETICS. 






vol. 248, 1995, pages 657-67, XP002127894 






the whole document 




X 


l/DAMTD \l r n Al . "CTDlirTllDC AC A MAT7C 

KRAMER V C ET AL; SIKULIUKt Ur A miLt 






TRYPTOPHAN SYNTHASE ALPHA SUBUNIT GENE 


39-42 




WITH PITH ENHANCED EXPRESSION" 






PLANT MOLECULAR BIOLOGY, NL.NIJHOFF 






PUBLISHERS, DORDRECHT. 






vol. 27, 1995. pages 1183-1188-8. 






XP002G71677 






ISSN: 0167-4412 




Y 


abstract; figure 1 


JO 


Y 


DATABASE EMBL - EMEST22 'Online! 


38 




Entry/Acc . no . C27627, 






6 August 1997 (1997-08-06) 






YAMAMOTO, K. ET AL.; "Rice cDNA, partial 






sequence. " 






XP002127896 






the whole document 




X 


DATABASE EMBL - EMEST2 'Online! 


35,40 




Entry/Acc. no. AA660642, 






14 November 1997 (1997-11-14) 






COVITZ, P. A. ET AL.; "00530 MtRHE Medicago 






truncatula cDNA 5' similar to tryptophan 






synthase alpha chain." 






XP002127897 






the whole document 




P,X 


WO 99 11800 A (HASEGAWA HISAKAZU ;HOKKO 


1-13, 


CHEM IND CO (JP); TERAKAWA TERUHIKO (JP);) 


18-21. 




11 March 1999 (1999-03-11) 


54-58 




page 125 -page 131; claims 1,2 






-& DATABASE WPI 






Section Ch, Week 9919 






Derwent Publications Ltd.. London. GB; 






Class COS, AN 99-228982 






XP002112346 






the whole document 





Form PCT/1SA/2 10 (continuation ot second sheen (July 1 992) 
BNSDOCID: <WO 9949056A3_L> 



page 4 of 4 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/US 99/06046 



Box 1 Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article I7(2){a) for the following reasons: 
1 . I I Claims Nos.; 

because they relate to subject matter not required to be searched by this Authority, namely: 



Claims Nos.: 57,58 

because they relate to parts of the International Application that do not comply with the presented requirements to such 
an extent that no meaningful International Search can be canied out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. I I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 

This International Searching Authority found multiple inventions in Ihis international application, as follows: 



see additional sheet 



1 . As all required additional search fees were timely paid by the applicant, this International Search Report covers all 

searchable claims 



2. I j As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



□ As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
covers only those claims for which fees were paid, specifically claims Nos.: 



4. I I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest The additional search fees were accompanied by the applicant's protest. 

j ^ j (Mo protest accompanied the payment of additional search fees. 
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International Application No. PCT/US 99/06Q46 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Continuation of Box i.2 
Claims Nos-: 57,58 



Present claims 57 and 58, relating to products which can be obtained by 
the methods of claims 55 and 55, respectively, have not been searched due 
to lack of support for such products within the meaning of Article 6 POT 
and/or lack of disclosure of such products within the meaning of Article 
5 PCT in the description. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 56.1(e) PCT], The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 2,3,10,11,23,24,27,28 completely, and 1,18-22, 
31-34,54-59 partially 



Corn anthranilate synthase subunits alpha and beta, portions 
thereof, nucleic acids encoding them and sequences with at 
least 80% homology thereto, chimeric genes comprising a 
portion of said nucleic acids, hosts transformed with said 
nucleic acids, method for altering expression of said gene, 
method for obtaining said nucleic acids, and method for 
evaluating the ability of a compound to inhibit the activity 
of said protein. 

2. Claims: 36,37,43,44 completely, and 35,50-59 partially 

As invention 1, but limited to corn tryptophan synthase 
subunit alpha. 

3. Claims: 4,12,25,29 completely, and 1,18-22, 31-34, 

54-59 partially 

As invention 1, but limited to rice anthranylate synthase 
subunits alpha and beta. 

4. Claims: 38,39,45,46 completely, and 35,50-59 partially 

As invention 1, but limited to rice tryptophan synthase 
subunit alpha. 

5. Claims: 5,6,13,14,40,47 completely, and 1,18-21,35, 

50-59 partially 

As invention 1, but limited to soybean anthranylate synthase 
subunit alpha and tryptophane synthase subunit alpha. 



6. Claims: 7-9,15-17,25,30,41,42,48,49 completely, and 1, 
18-22,31-35,50-59 partially 

As invention 1, but limited to wheat anthranylate synthase 
subunits alpha and beta, and tryptophane synthase subunit 
alpha. 
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